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CHANGES IN THE APPEARANCE OF STIMULI 
OF VERY HIGH LUMINANCE! 


TOM N. CORNSWEET 


University of California, Berkeley 


When quanta of light strike the 
retina, some of them are absorbed by 
the pigment molecules contained in 
the visual receptor ceils, and this ab- 
sorption ultimately results in the acti- 
vation of the visual nervous system. 
Recent developments in the biochem- 
istry of the visual pigments has given 
us a fairly clear picture of the micro- 
structure of quantal absorption. A 
theoretical analysis of this first stage 
in the visual process will be presented 
and this analysis will be shown to 
furnish a possible explanation of cer- 
tain striking changes in hue and 
brightness that occur when a stimulus 
of very high luminance is steadily 
viewed. 


THEORY OF VISUAL PHOTO- 
CHEMISTRY 


There is fairly good agreement that 
light triggers neural events by proc- 
esses of the following general type. 
Quanta of light, after passing through 
the other media of the eye, arrive at 
the retina where they may or may not 


' This research was supported by United 
States Public Health Service Grant Number 
B2169. The writer also wishes to thank 
Robert Abelson and Norman Anderson for 
their helpful suggestions, and James Thomas 
for his work on the brightness reversal 
experiment. 
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hit receptor cells. Those quanta that 
finally arrive in the region of the 
visual pigment of the receptor cells 
may or may not be captured by the 
molecules of pigment. If a quantum 
is in the close vicinity of a particular 
pigment molecule, the probability 
that it will actually be captured by 
the molecule depends upon the rela- 
tionship between the wave length, or 
energy, of the quantum and the physi- 
cal structure of the molecule and its 
surroundings. If all these probabili- 
ties are satisfied, the quantum is ab- 
sorbed by a visual pigment molecule. 
When, before absorption of the quan- 
tum, the pigment molecule was in a 
state that will hereafter be called the 
“regenerated”’ state, the absorption 
will cause the molecule to change its 
state (to what will be called an ‘‘un- 
regenerated” state). For example, 
Kropf and Hubbard (1958) suggest 
that the change in state is a cis-trans 
isomerization. The changing of state 
of a regenerated pigment molecule 
sets up a process in the receptor which 
eventually results in the discharge of 
a packet of some chemical mediator at 
a synapse farther along in the retina. 
When the concentration of chemical 
mediator at the synapse is great 
enough, a nerve impulse is triggered, 
and either that impulse or another 
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resulting from it, travels up the optic 
nerve. At the synapse, there is also a 
substance that destroys the chemical 
mediator and this, combined with 
diffusion effects, give the mediator a 
short “half life.’’ Therefore the prob- 
ability of triggering a nerve impulse 
depends upon the rate of discharge of 
packets. Above threshold, the fre- 
quency of nerve impulses will be 
greater, the greater is the rate at 
which packets of mediator are being 
discharged. Since each discharged 
packet is the ultimate result of the 
absorption of a quantum by a regen- 
erated molecule, the frequency of 
nerve impulses will depend upon the 
rate at which quanta are absorbed by 
regenerated molecules. 

The changes that occur in retinal 
photopigment when light is absorbed 
are reversible, and, if the eye remains 
in darkness long enough, virtually all 
of the pigment will have regenerated 
to its initial, ready-to-trigger, state. 
Some of the more recent evidence on 
cone pigments in vivo indicates that at 
least the main part of this regenera- 
tion process goes on at the same rate 
whether the eye is in darkness or in 
light, the rate depending only upon 
the concentration of unregenerated 
pigment present (Rushton, 1958). 

If the above general conceptions are 
correct, then, when a_ subject is 
steadily fixating a field of fixed lumi- 
nance, retinal pigment is both being 
broken down and regenerated, and the 
frequency at which nerve impulses are 
being fired in any one optic nerve 
fiber depends upon the rate at which 
regenerated molecules are _ being 
broken down. Hereafter, the term 
“output” will be used to denote the 
instantaneous concentration of chemi- 
cal mediator at the receptor terminus. 
This receptor output is thus directly 
related to the rate at which quanta are 
absorbed by regenerated molecules, 
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Proportion of Regenerated Pigment 











Time 
Fic. 1. Proportion of regenerated pig- 
ment as a function of the time after onset of 
the stimulus, for two different illuminances 
(B greater than A); and output as a function 
of the time after onset of the stimulus for the 
same two illuminances. 


and it is similarly related to the fre- 
quency of triggering of nerve im- 
pulses somewhere farther along the 
retinal activity chain. 

The upper curve, labeled A, in the 
upper part of Figure 1 is a generalized 
representation of the changes in pig- 
ment concentration that would occur 
during stimulation of a single receptor 
if the preceding considerations are 
correct. The receptor is completely 
dark adapted before the onset of the 
stimulus, so that the proportion of 
regenerated to total pigment is 1.0. 
At the time represented at the origin, 
a steady light is turned on. The 
pigment concentration will drop rap- 
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idly at first and then level off as it 
approaches some asymptotic level, 
the level at which the rates of break- 
down and regeneration are equal. 
The lower curve, labeled B, repre- 
sents the same events for an identical 
receptor under a more intense stimu- 
lus. 

The curves in the lower part of 
Figure 1 are a generalized representa- 
tion of the outputs (rates of pigment 
breakdown) of the same two receptors 
under the conditions of the upper 
part of Figure 1. Since the receptors 
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Fic. 2. Proportion of regenerated pig- 
ment as a function of time after onset of the 
stimulus, for two different illuminances (A at 
moderate illuminance and B approaching an 
infinite illuminance) ; and output as a function 
of time after onset of the stimulus for the 
same two illuminances. 
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are taken to be identical, and since 
at time 0 both concentrations are 
1.0, the initial output of B will be 
greater than the initial output of A. 
Further, since the time constants of 
regeneration for A and B are taken 
to be equal, the output of B must ap- 
proach a final level that is higher than 
the asymptote of A. This is true 
even though the equilibrium concen- 
tration of regenerated pigment is 
lower for B than for A. 

The upper part of Figure 2 repre- 
sents the regenerated pigment con- 
centration curves for the same two 
receptors when the illuminance on B 
has been raised almost to infinity, but 
the illuminance on A is the same as in 
the preceding figure. In the first 
short interval after the light is turned 
on, virtually all of the pigment mole- 
cules in B will be broken down. 
From that time on, each molecule 
that flips back to the regenerated 
state will immediately be broken down 
again. In other words, when the 
illuminance approaches infinity, the 
concentration approaches equilibrium 
extremely rapidly, and the equilibrium 
level depends only upon the time 
constant of regeneration of the pig- 
ment. The lower part of Figure 2 
shows the output curves for these re- 
ceptors, again when the illuminance on 
B approaches infinity. It will again 
be true that, when the stimuli are 
first turned on, B will have the higher 
output. Further, at equilibrium, B 
must still have the higher output, 
even though its regenerated pigment 
concentration is very much lower. 
In general, the equilibrium rate of 
breakdown of regenerated pigment 
(output) is greatest when the illumi- 
nance is infinity, since that rate equals 
the rate of regeneration and since the 
rate of regeneration will be greatest 
when the concentration of unregener- 
ated pigment is greatest. But, since 
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Receptor B will reach its equilibrium 
level almost instantaneously, its out- 
put curve must cross the output of A 
twice, as in the lower part of Figure 2. 
(This is necessarily true as long as the 
illuminance on A is great enough that 
its initial output level is greater than 
the equilibrium level for B.) Obvi- 
ously the illuminance on one of a pair 
of receptors need not be nearly in- 
finite in order that the pair exhibit a 
reversal of output relations. There 
will be a large set of pairs of illumi- 
nances for which reversal should 
occur. 


A More Speciric SET OF ASSUMP- 
TIONS AND THEIR CONSEQUENCES 


Now let us substitute, for each of 
the general conditions described above, 
a corresponding specific assumption 
about the operation of the visual sys- 
tem. From these assumptions, cer- 
tain quantitative aspects of the visual 
phenomena to be discussed may be 
derived, and the derivations examined 
in comparison with the data them- 
selves. 


If the absorption of a quantum by a 
particular molecule is independent of 
the state of the other molecules in the 
receptor, then it follows from statis- 
tical considerations that the rate of 
absorption by regenerated molecules 


1S: 


(1] 


(F) ~ AON 
i Q: 


where N is the number of regenerated mole- 
cules, A is the probability that a quantum 
incident on a region containing a regenerated 
molecule will bleach that molecule (the photo- 
sensitivity of each regenerated molecule at 
the wavelength of the incident quantum), 
and Q is the number of quanta incident dur- 
ing the Time dé. 


Recent studies of the bleaching of 
cone pigments in the living human 
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eye support the notion that, at least 
to a good approximation, the regen- 
eration of cone pigments is a first 
order chemical reaction (Rushton, 
1958). We will assume, accordingly, 
that the probability that any particu- 
lar unregenerated molecule will be 
regenerated in a unit of time will be 
independent of the states of the sur- 
rounding pigment molecules. It fol- 
lows then that the rate of regeneration 
of unregenerated molecules is: 


N 
(“ ) =K(M-—N) [2] 
dt J. 


where M is the total number of molecules, re- 
generated or not, in the receptor, N is, again, 
the number of regenerated molecules, and K 
is the probability that an unregenerated 
molecule will be regenerated in Time d¢. 


The net rate of change of regener- 
ated molecules is: 


ee LON 
7 = K(M — N) — A@ 


[3] 

We will now define the quantity X, 
the proportion of regenerated pigment 
in a receptor: 


[4] 


Substituting Equation 4 into Equa- 
tion 3 and dividing through by M: 


dX 


yp = Kl - X)- 40x [5] 


The solution to this differential 


equation yields: 


opine... 
~ K+AQ* K+AQ‘ 


x —UK+A® [6] 


when the initial condition is X = 1, 
that is, complete dark adaptation. 
In the preceding section, the output 
of a given receptor was defined as the 
rate at which quanta are absorbed by 
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regenerated molecules. Therefore: 


dN 
=> - — l j=, )) y, 4 
2 ( = ) AQN=AQMX 


K 
o=40m| x fot 


AQ 


K+AQ ° [7] 


war40 | 
This is an equation that relates the 
output of any given receptor to the 
irradiance and duration of a steady 
light falling upon it.2, The lower parts 
of Figures 1 and 2 are actually plots 
of Equation 7 for various values of 


Q. 
CONSEQUENCES FOR BRIGHTNESS 


It seems to be a reasonable assump- 
tion that the brightness of a lighted 
area will increase as the frequency of 
nerve impulses originating from the 
retinal region corresponding to that 
area increases. If this assumption 
and those concerning the conversion 
of quanta to impulses are correct, 
then the falling of the curves in the 
lower parts of Figures 1 and 2 repre- 
sent the fact that the brightness of a 
stimulus of fixed luminance decreases 
as the eye light adapts to it. 

The considerations illustrated in 
the lower part of Figure 2 also lead to 
the following prediction. If the eye 
is first dark adapted, then one area of 
the retina is illuminated (say with 
white light), and another area is 
simultaneously illuminated much 
more intensely, the more intensely 
illuminated area will look first brighter 


*None of the steps in this derivation is 
original. Pieces of it can be found in the 
writings of Troland (1930), Hecht (1935), 
Rushton (1958), for example. And, al- 
though the writer has not found the complete 
derivation all in one place in the literature, 
it may well be there somewhere. 
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Fic. 3. Times of brightness reversals as a 
function of retinal illuminance for the stimu- 
lus configuration shown in the upper right 
corner. (The surround was dark, the back- 
ground illuminated, and the horizontal bar 
consisted of a strip of 0.4 log unit neutral 
density filter. Each point is a mean of four 
judgments. ) 


than the other, then darker, and 
finally somewhat brighter again. 
That is, there should be a period of 
time during which the less intensely 
illurninated region should look brighter 
than the more intensely illuminated 
one. This paradoxical prediction has 
been tested and it is correct. The 
predicted brightness reversal does 
indeed occur, and is very easy to 
demonstrate. 

The data in Figure 3 were taken 
under the following conditions. The 
subject fixated a small bright point 
in the center of the field shown in the 
upper right corner of the figure. The 
field was shown in Maxwellian view, 
through a simple optical system whose 
effective aperture was 3.5. The 
source was a 32-candle power auto- 
mobile taillight bulb, run from a regu- 
lated DC _ supply. The filament 
image at the pupil was 1 millimeter 
in diameter, centered on the natural 
pupil. The luminance of the field 
was varied with neutral density filters. 
Heat glass was used to filter out the 
infrared. 
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When the field is first turned on, it 
looks like a dark bar on a bright field. 
But after a time that depends upon 
the luminance relations, the field ab- 
ruptly reverses, so that the bar looks 
considerably brighter than the back- 
ground. Then the brightnesses grad- 
ually converge and reverse again, 
until finally the background looks 
just slightly brighter than the bar. 

To collect the data in Figure 3, the 
following procedure was used on each 
trial. The subject’s viewing eye was 
dark adapted for 5 minutes. Then 
the stimulus figure was turned on and 
two clocks. started. The subject 
looked steadily at the fixation point, 
and at the first instant when the bar 
and its background were equally 
bright (changing from normal to re- 
versed brightnesses) he pressed a key, 
stopping one of the clocks. When the 
brightnesses were again equal (chang- 
ing from reversed to normal) he 
pressed a second key, stopping the 
second clock. The two times were 
recorded, and each point on the curves 
in Figure 3 is a mean of four such 
times. The area between the two 
curves in Figure 3 represents the 
times when the brightnesses are re- 
versed. At least 45 minutes elapsed 
between successive trials, and the 
order of illuminances was balanced. 


The second equality point is much 
harder to judge than the first one, 
and the judgments are less reliable. 
Part of the reduction in reliability is 
due to the fact that the changes are 


occurring very gradually, but, in 
addition, strong hue differences be- 
tween the bar and ground appear 
after the first 20 seconds or so; the 
background turns reddish and the 
bar less red. (There is not much 
point in trying to interpret these 
color differences physiologically, be- 
cause the wave length composition of 
the stimuli in this set of experiments 
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Fic. 4. Times of brightness reversals as a 
function of retinal illuminance. (For these 
data, the bar, as shown in Figure 3, was re- 
placed by an opaque strip. Each point is 
the mean of four judgments.) 


was very complex, due to absorption 
by the heat glass, etc.) 

The brightness reversal occurs 
equally strongly whether the fixation 
point is in the bar or the background. 
It cannot be explained by inhomo- 
geneities in the retinal structure. 

The data in Figure 3 were taken 
when the bar was .4 log units less 
intense than the background. (The 
bar was simply a strip of .4 density 
neutral gelatin filter pasted across the 
larger square aperture that formed 
the background.) But brightness re- 
versal also occurs when the bar trans- 
mits no light to the retina. The data 
for Figure 4 were taken after the strip 
of .4 density filter was replaced by an 
opaque strip. In this case, the time 
relations are less strongly dependent 
upon the illuminances (and the hue 
differences more extreme), but the 
effect is still very strong. The fact 
that the strip is opaque does not 
mean that the actual retinal illumi- 
nance in the region of the image of the 
bar is zero. There is extensive evi- 
dence in the literature to indicate that 
the stray light in this sort of stimulus 
situation is appreciable (see DeMott 
& Boynton, 1958, for an extreme ex- 
ample). Furthermore, there is a good 
possibility that the illuminances at 
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Fic. 5. Apparatus for a demonstration 
of brightness reversal. (The inset figure 
shows the subject’s view.) 


the edges of the figure (where stray 
light is greatest) are the critical ones. 
(This possibility is currently being 
tested.) Thus, the retinal condition 
in which an opaque bar is used in the 
apparatus is on a continuum with 
any in which a filter of transmittance 
greater than zero is used. 


It is easy to set up a demonstration of this 
brightness reversal, as illustrated in Figure 5. 
A strip of cardboard is placed across a round 
hole in a larger board. This hole is just a 
little smaller than the face of the sort of photo- 
flood lamp that has a frosted face and a 
built-in reflector (e.g.,GE #PH/RFL2). A 
small hole in the middle of the strip will serve 
as a fixation point (it is essential to fixate 
very steadily for this demonstration). The 
face of the photoflood lamp should be viewed 
through the hole and strip, and through heat 
absorbing glass, from about 4 or 5 feet away. 


AN EXTENSION OF THE THEORY TO 
CHANGES IN HUE 


When a subject steadily fixates a 
field of extremely high luminance in 
the long wave length region of the 
spectrum, the hue of the field under- 
goes very striking changes. Some of 
these changes have been mentioned 
independently by Auerbach and Wald 


(1955) and by Noddack and Jarzyk 
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(1955), in the course of studies of 
other color phenomena. Cornsweet, 
Fowler, Rabedeau, Whalen, and Wil- 
liams (1958) briefly reported a study 
of the color changes themselves. Ac- 
cording to all of those articles, the 
nature of the color changes is as fol- 
lows. The subject looks steadily at a 
field that is illuminated with light 
from the long wave length end of the 
spectrum. When the luminance is 
low or moderate, the field will look red 
or orange. However, if the lumi- 
nance is extremely high, it will look 
orange at first, but then rapidly shift 
in hue through yellow to a vivid green. 
This pattern of changes is reported 
to occur for stimuli of several different 
wave lengths in the “red” end of the 
spectrum, and for monochromatic as 
well as broad band stimuli. Stimuli 
with wave lengths in the green region 
of the spectrum are reported to pro- 
duce extreme desaturation, but no 
clear hue changes are reported. Auer- 
bach and Wald (1955) also report that 
bright blues will change to red, but 
such a change was not observed in 
either of the other two studies cited. 
A review article by Cohen (1946) pro- 
vides a number of references to other 
studies of the effects of high luminance 
stimulation. 

The theoretical considerations dis- 
cussed in the preceding sections sug- 
gest an explanation for some of these 
changes in hue. The explanation is 
based upon the retinal mechanisms 
already mentioned, along with some 
simple and widely held assumptions 
about the physiological bases for color 
vision. 

First, we will assume that there are 
at least two sorts of cones that differ 
in the pigments they contain, the two 
pigments having different but over- 
lapping absorption spectra. The ab- 
sorption spectra for such a pair of 
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Fic. 6. Generalized spectral absorption 
curves for two hypothetical retinal pigments. 
(The wave length 610 mu, indicated on the 
abcissa, will be specifically referred to later 
in the text.) 


cone types are shown in Figure 6. 
The two cone systems will be labeled 
R and G, as in Figure 6. Now con- 
sider the situation in which a subject 
is shown a stimulus and is forced to 
call it either red or green, no other 
alternative being permitted. Let us 
assume that the probability of his 
giving a “‘red’’ judgment depends upon 
the ratio of the outputs of the R and 
G systems. In the simplest case, 
when the R output is greater, he will 
be more likely to call the stimulus red, 
when the G output is greater, he will 
call it green, and when they are equal, 
he will judge it green half the time 
and red the other half. There is no 
theoretical necessity that the output 
ratio be 1.0 for 50% ‘‘red’’ judgments, 
but that particular condition will be 
assumed for much of the remainder 
of this paper, merely to make the dis- 
cussion easier to phrase. 

Figure 7 shows the results obtained 
when a subject is forced to judge a 
flash of monochromatic light as either 
red or green, as the wave length in the 


*The theory discussed in this paper re- 
quires that there be at least two cone systems, 
but does not approach the question of how 
many there may be altogether, 


region between red and green was 
varied by the method of constant 
stimuli. It should be noted that the 
slope of this curve is very steep, so 
that a change of just one or two 
millimu on either side of the 50% 
wave length will elicit reliable judg- 
ments of red or of green. Now if a 
subject is given a stimulus with a wave 
length right at the 50% point and 
asked what its hue really is, he says 
it is pure yellow. It follows from 
these observations that the last as- 
sumption discussed above is equiva- 
lent to assuming the following: when 
the R output exceeds the G output, 
the subject will see red, when the G 
output is greater, he will see green, 
and when the two are equal, he will 
see yellow. (This is of course true 
only for the spectral region between 
red and green. There may be other 
spectral regions in which the R and 
G outputs are also equal, but where 
the hue is given some other name, 
e.g., blue. The question of whether 
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Fic. 7. Distribution of judgments for 
constant stimulus method. (On each 1/4- 
second flash, the subject was given a forced 
choice between responses ‘“‘red” and “green.”’) 
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the yellow sensation is produced by a 
mixture of the R and G outputs or 
represents an excess of yellow over blue 
output with the red and green out- 
puts cancelling each other, is not 
relevant to any of the present dis- 
cussion. ) 

The explanation of some of the hue 
changes that occur to stimuli of high 
luminance follows directly from these 
assumptions. Monochromatic light 
in the long wave region of the spec- 
trum, for example at 610 millimu, will 
be absorbed by both the R and the G 
cone pigments, but since the absorp- 
tion coefficient for the R pigment is 
higher than the G at 610 millimu 
(see Figure 6), the R cones will ab- 
sorb a greater percentage of the inci- 
dent quanta. This is exactly equiva- 
lent to putting a greater illuminance 
on the R than the G cones. There- 
fore, if the R and G cones differ only 
in their absorption spectra, and a 
stimulus of very high luminance at 


610 millimu is given, their output 
versus time curves will look like those 
in the lower part of Figure 2, Curve A 
representing the G and B the R sys- 


tem. The stimulus should look red 
at first, then change through yellow 
(when the curves cross) to green, and 
finally slowly change back through 
yellow to orange. 

It has been reported in previous 
papers (Cornsweet et al., 1958) that a 
high-luminance red stimulus will ap- 
pear to change from red through yel- 
low to green. “No further color 
changes were observed, even though 
fixation was maintained for as long as 
three minutes” (p. 898). The present 
more extensive data at a variety of 
wave lengths, and at luminances 
greater than those used in the earlier 
study, clearly show that further 
changes do occur. In fact, the hue of 
the stimulus eventually changes back 
from green through yellow to orange 
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as is predicted by the theoretical 
argument above. This later set of 
changes happens to take appreciably 
longer than 3 minutes at the relatively 
low luminances and at the particular 
wave lengths used in the earlier 
study (Cornsweet et al., 1958). But 
at other wave lengths and luminances, 
all of the changes appear to be over 
within a minute or two (see the data 
in succeeding sections). 


GENERAL ASPECTS OF HUE CHANGES 


The general course of the hue 
changes that occur when viewing a 
high-luminance stimulus at 620 milli- 
microns is illustrated in Figure 8. 
This figure is drawn from a set of 
crude binocular matchings, in which 
the high-luminance stimulus was pre- 
sented to one eye and a matching 
stimulus of much lower luminance, 
whose wave length could be varied, 
was presented to the other. The 
curve in this figure should not be 
taken as anything more than a quali- 
tative indication of the phenomenon 
under discussion. Quantitative data 
of a more reliable and useful sort will 
be presented later. 
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Fic. 8. Wave length of a low luminance 
stimulus flashed to the left eye that matches 
the hue of a high luminance stimulus, at 620 
mu, steadily fixated by the right eye, as a func- 
tion of the time of viewing the high-luminance 
stimulus. 
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At this writing, the limits of the 
range of wave lengths for which hue 
changes occur have not been well 
established. However, the following 
general statements are true. High- 
luminance stimuli with wave lengths 
in the ‘“‘green-yellow’’ region of the 
spectrum (from about 560 to 580 
millimu) look green at first, turn a 
much deeper green, then back through 
yellow to orange, and finally back to 
yellow or greenish yellow. Any wave 
length between 580 and 655 mu 
will look orange at first, change 
through yellow to a good, saturated 
green, and finally back to an orangish 
yellow. It is probable that light of 
wave lengths longer than 655 mu also 
appears to turn green if its luminance 
is high enough, but 655 mu is the limit 
with the present apparatus. 

Stimuli below 560 mu have not been 
studied systematically, but in general 
they seem simply to desaturate to a 
white at the luminances available in 
the present apparatus. 

For stimuli in the region of about 
620 down to 560 mu or less, when the 
luminances are well above those nec- 
essary to produce the change to 
green, another change seems to be 
superimposed upon the first 15 or 20 
seconds of viewing. The field begins 
its normal course of hue change, but 
then abruptly turns to a desaturated 
pink or violet. It retains that ap- 
pearance for as long as 20 seconds, 
then changes back to orange, yellow, 
or green, and the normal sequence is 
resumed. This change to pink or 
violet has not yet been studied 
quantitatively. 

All of the changes in appearance 
that have been observed depend in 
character and timing upon the lumi- 
nance, and, of the wave 
length of the stimulus. These de- 
pendencies will be analyzed below. 


course, 
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QUANTITATIVE ANALYSIS OF HUE 


CHANGES 


The photochemical theory out- 
lined above requires that the time 
course of changes in hue be systemat- 
ically related to the luminances and 
wave lengths of the stimuli. In order 
to evaluate the theory quantatively, 
data have been collected on the time 
course of hue changes as a function of 
retinal illuminance for a number of 
wave lengths. Those data and their 


relationship with the theory will be 
presented below. 


PROCEDURE 


Judgments Used for the Collection of 
Quantitative Data 


The data presented here were collected by 
using two different sorts of procedures. The 
data for Figure 8 (and the data for the bright- 
ness reversals discussed earlier) are based 
upon a matching technique. Pairs of stimuli 
are found which differ physically but are 
judged the same along some perceptual di- 
mensio:: (hue in Figure 8, hue and brightness 
for the brightness reversal data). The other 
procedure used here is a sort of hue naming 
or identification. For example, most of the 
data in this report result from recording the 
times at which a subject judges the hue of 
the stimulus to be pure yellow, that is, neither 
reddish yellow nor greenish yellow. 

In situations where a choice may be made 
between matching and hue naming techniques, 
it is usually far preferable to employ matching 
(Brindley, 1960). Hue naming was used 
principally here for two reasons. First, for 
the particular phenomenon studied, the prob- 
lems introduced by matching are considerably 
worse than those of hue naming. For ex- 
ample, to determine the time at which a 
stimulus of high luminance produces a hue 
match with a standard hue (e.g., yellow) the 
matching field must be at a low enough lumi- 
nance that its hue is not changing very much. 
Therefore, the luminance of the matching field 
should be at least 2 log units lower than of the 
test field. However, the stray light from the 
test field then approaches and perhaps even 
exceeds the direct illuminance of the matching 
field. If the matching field is presented to 
one eye and the test field to the other, the 
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effects of the stray light are not eliminated, 
but just rendered less estimable. The collec- 
tion of the data represented in Figure 8 took 
advantage of the fact that, even when the two 
fields are projected on to noncorresponding 
parts of the two retinas, rivalry occurs. That 
is, if the stimulus of high luminance is pre- 
sented continuously and the matching field is 
occasionally flashed to the other eye, the only 
thing visible during the flash seems to be the 
matching field. It is then possible to find a 
test flash wave length that is the same hue as 
the high-luminance field at a given time after 
the onset of the high-luminance field. How- 
ever, this procedure clearly involves so many 
factors of unknown effect (stray light, rivalry, 
transients from the flash, etc.) that it is at 
best useful only to give a rough idea of the 
sorts of hue changes that do occur. 

\ different sort of matching technique 
might have been used in this study. Instead 
of determining the time at which the intense 
field matches one of a fixed hue, the subject 
might have been presented with two patches 
having different wave lengths and high but 
different Then as the patches 
both changed in hue (and brightness), the 
time when they match could have been deter- 
While not all pairs of stimuli would 
reach a match, there must be sets of them 
that would do so. This kind of matching 
procedure would not be so severely affected 
by stray light, and in many ways is preferable 
to the color naming Brindley 
(1960) has clearly pointed out that this type 


luminances. 


mined. 


procedure. 


of matching procedure (or at least one almost 
identical with it) enables one to draw physio 
logical conclusions from psychophysical data, 
without requiring what he calls “psycho- 
physical linking hypotheses.’”” However, it 
is also true that, when such a procedure is 
used, psychophysical linking hypotheses can- 
not be tested, and one of the aims of this 
paper is to examine just such a question, 
namely, what are the physiological correlates 
of the condition under which a subject re- 
ports the hue as red half the time and green 
the other half. The matching technique 
however does have many advantages over 
color naming as long as psychophysical link- 
ing hypotheses are not in question, and studies 
using strict matching procedures are now in 
progress. 

On the other hand, the procedure that con- 
sists of recording the instants in time when 
the subject calls the stimulus neither reddish 
nor greenish yields very reproducible results, 
and results which may be interpreted in a 
relatively straightforward way. 


Apparatus 


The apparatus used to collect all of the 
color data in the present study is shown sche- 
matically in Figure 9. The source, S, was a 
tungsten-coil-filament lamp (GE PH/18A/ 
T10P), run from a constant voltage trans- 
former at 6 volts, 18 amperes. Lens Li 
formed an image of the filament on the en- 
trance slit, En, of a Bausch and Lomb grating 
monochromator of 500 millimeter focal length, 
with a 600 lines per millimeter grating, blazed 
in the visible. The optics of the monochro- 
mator imaged the entrance slit (and the fila- 
ment) on the exit slit, Ex, with an equivalent 
aperture ratio of 4.4. An airspaced, coated, 
achromatic lens, Le, focal length 150 milli- 
meters, f = 1.9, imaged the exit slit in the 
plane of the pupil of the subject’s eye so that 
he saw a Maxwellian view of the back surface 
of the lens Lz. The entrance and exit slits 
of the monochromator were set to widths of 
1.5 millimeters, resulting in stimuli with 
spectral band widths of 5 millimu. The 
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Fic. 9. Apparatus for delivering 
intense monochromatic stimuli. (See 
for description of specific elements.) 


very 
text 
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length of the exit slit was also set to 1.5 milli- 
meters. Therefore, since Lens L2 gave unit 
magnification, the image at the pupil was a 
square 1.5 millimeters on a side. This is 
small enough that as long as the exit slit is 
imaged at the center of the natural pupil, 
changes in the size of the natural pupil will 
not produce appreciable changes in retinal 
illuminance. A _ neutral-density wedge, W, 
and filters, F, were used to adjust the lumi- 
nance of the stimuli. The wedge was close 
enough to the exit slit that no balancing 
wedge was needed. A clear piece of glass, G, 
reflected a small fraction of the stimulus light 
to a photomultiplier tube, PH, and the output 
of the phototube was used to monitor the 
luminance of the stimulus.‘ 

A Hilger thermopile and mirror galvanom- 
eter were used to make the energy calibra- 
tions. The thermopile was placed at the 
pupil position during these measurements. 
The spectral purity of the stimulus lights 
was measured with a second monochromator 
set at a 5 millimu band width. When the 
stimulus was at any of the wave lengths 
studied, no other band had a luminance as 
great as 1% of the stimulus luminance, and 
the total light at other wave lengths was less 
than 5% of the stimulus luminance. 

The subject’s head was held in position by 
a dental-impression bar. At his hand was a 
control panel containing push buttons which 
operated timers, so that the timing of any 
changes in appearance could be recorded. 

When the stimulus was on, the subject saw a 
circular field, uniformly illuminated, as illus- 
trated in Figure 10. The dark circular line 
in this field was produced by cementing a 
circle of fine wire to the back surface of Lz. 
This circle helped the subject to make his 


‘For the relatively short slit-length used 
here, the coil filament bulb delivered more 
energy through the system than did the rib- 
bon filament bulb that is ordinarily supplied 


with the monochromater. However, using 
the coil filament, the energy at the exit slit 
varied somewhat from day to day in an un- 
predictable way. This variability was prob- 
ably due to the fact that only a very small 
piece of the filament image was used, and, 
since the image is not uniformly intense, 
slight amounts of sagging of the coil or its 
supports could result in noticeable changes in 
output at the exit slit. For this reason, the 
monochromater output was 
monitored, and the intensity settings were 
made from the photomultiplier output rather 
than from the wedge or filter calibrations. 


continuously 


N. CoRNSWEET 


Fic. 10. Stimulus pattern as it appeared 
to the subject. (The entire 10-degree circu- 
lar area was uniformly illuminated with mono- 
chromatic light. The subject was instructed 
to fixate the center of the pattern and judge 
the hue of the inner disk.) 


judgments, in that he judged only the hue 
within the circle, ignoring the occasional edge 
effects that small eye movements produced at 
the outer margins of the field.® 
RESULTS OF HUE CHANGE StTupy 
The data plotted in Figure 11 show 
how the time taken to turn yellow 
varies as a function of the retinal ir- 
radiance of a monochromatic stimu- 
lus at 610 millimu. For example, at 
an irradiance of 3 X 10” quanta per 
second per square millimeter of retina 
(uncorrected for ocular transmission 
losses), the patch looked red or orange 
at first. The subject pressed his key 
to indicate that it was yellow (neither 
reddish nor greenish) after 17 seconds 
of viewing. He pressed a second key 
to indicate the second occurrence of 


5 The choice of a _ 10-degree 
stimulus crucial. The hue 
changes occur with fields as small as 15 min- 
utes of arc in diameter. (We have not tested 
still smaller sizes.) But the variance of the 
temporal aspects of the judgments is smallest 
when the field is large. 


particular 
disc is not 
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Fic. 11. Time to change hue as a function 
of the retinal irradiance of the stimulus, at 
610 mu. (Each point is a mean of two read- 
ings. ) 


yellow 100 seconds after the stimulus 
had first been turned on. The area 
between the two curves represents 
the times during which the stimulus 
looked green. 

The upper curve, for the second 
change to yellow, extends out to a 
retinal irradiance of 21 &K 10" quanta 
per second per square millimeter, the 
maximum irradiance available in the 
apparatus. The lower curve is not 
plotted beyond 3.9 K 10" quanta 
sec/mm? because the hue changes in 
that region were obscured by a change 
to desaturated pink or violet, as indi- 
cated roughly on Figure 11. For 
this wave length, the stimulus never 
appeared yellow at irradiances below 
1.2 * 10" quanta/sec/mm?’. 

Figure 12 shows the same sort of 
data for light of three other wave 
lengths in the long wave end of the 
The 600 
millimu are very similar to those for 


spectrum. curves for 
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610 millimu (Figure 11). As the 
wave length increases, the lower 
curves shift toward the higher irradi- 
ances, but retain the same general 
shape. The upper curve at 630 
millimu shows evidence of a peak at 
the lower retinal irradiances. The 
upper curve at 640 millimu exhibits 
a very large amount of variability, 
and is not plotted beyond 9 X 10” 
quanta/sec/mm? because the change 
back to yellow had not occurred dur- 
ing 210 seconds of continuous viewing. 
The variability in all cases increases 
very greatly as the time of viewing 
It will be 
shown that curves of this form may be 


becomes extremely long. 


expected from the theoretical con- 


siderations discussed previously. 
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Fic. 12. Time to change hue as a function 
of retinal irradiance for three different wave 
lengths. (Each point is a single judgment.) 
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RELATIONSHIP BETWEEN THE DATA 
AND THE THEORY 
The data in Figure 11 have been 
replotted as dots in Figure 13. They 
show how the time taken to turn 
yellow varies as a function of the ir- 
radiance of a monochromatic stimulus 
at 610 millimu: Every dot in this 
figure represents a time when the 
stimulus looked yellow. Therefore, 
according to the theoretical arguments 
outlined above, each of these points 
stands for a condition where the out- 
puts of the R and the G systems are 

in some particular ratio. That is: 


2, = CQ, rs] 


where C is the value of the ratio at 
which the hue is equally often called 
red and green in a forced choice. (In 
the preceding discussion C was as- 
sumed to be 1.0 for simplicity.) 
Substituting the photochemical 
Equation 7 for output in Equation 8, 
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Fic. 13. 


of retinal irradiance, for 


(Each point is a mean of two judgments. The 


smooth curve is a plot of the iterative solu- 


tions to the implicit Equation 10, using the 


parametric values listed in the text.) 
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and supplying the appropriate sub- 
scripts: 


1.0OM Be 
ee Ke+A:0 


AQ t(Kr+ArQ) | 


+k +4,0° ; 


pies Ky 
=ca,om,| +10 
0 seg 


i” 
K,+A,Q° 


where ¢ is the time of occurrence of yellow. 
[his equation may be rewritten: 


-+ 


t(Kg+ AgQ) | [9] 


K,A, 
K,+A,0 


AZQ 

K,+4A,Q‘ 
K,A, 

Pere 


AgQ 
K,+A,Q- 


((Krt+ArQ 


+4. 
4. 


t(Kg+AgQ | [10] 


where J = M,C/M,. 

A crude iterative procedure has 
been used to find the values for the 
five parameters of this equation that 
yield a good fit to the data. The 
two smooth curves drawn in Figure 
13 are a plot of Equation 10, when 
the following values of the param- 
eters are used: 


time constant of regeneration of red system, 
K, = .010 sec 

time constant of regeneration of green system, 
K, = .0083 sec™ 

photosensitivity of red system at 
A, = 1.70 X 10-"% 
photosensitivity of green system at 610 mu, 
A, = 1.32 X 1075 

ratio of amount of green to red pigment multi- 
plied by the output ratio for yellow judgment, 
J =1.18. 


610 mu, 


These preliminary values may be 
compared with Rushton’s (1958) esti- 
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mates: 
K, = .008 sec 
K, = .008 sec 


A = .25 X 10°" 


for the cone system as a whole, based 
upon measurements of the bleaching 
of cone pigments by white light, and 
corrected for an ocular transmission 
loss of 50%. Assuming that only 
an R and a G pigment are present, 
and correcting for 50% transmission 
losses, the estimate of A for the entire 
cone system on the basis of the pres- 
ent data is: 


A,=1/.5A,.= 2[1—(1—A,)(1—A,) ] 
= 2(A,+A,) 
A,~ 6X 10-5 cm? 


The value of A, determined from the 
color reversal data agrees much more 
closely with that quoted by Brindley 
(1960). From psychophysical data 


(quite different from those presented 
in the present paper), he estimates 


that A, = 1.61 *10-% (p. 217). 
The writer has been unable to find 
estimates of the constant “J’’ in the 
literature. (This constant represents 
a psychophysical linking hypothesis, 
in Brindley’s terminology.) 

The agreement between these esti- 
mates is not perfect, nor is the fit in 
Figure 13. Its presentation is not 
intended to imply that the present 
formulation and _ these particular 
values are the “true’’ ones. The 
plot is meant to serve two purposes. 
First, it makes clear that the general 
formulation is consistent with the re- 
ported hue changes when put into at 
least this specific form. Second, it 
serves as a preliminary report on the 
extensive work now in progress at- 
tempting to evaluate, quantitatively, 
the specific theoretical arguments pre- 
sented here. This work consists in 
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making determinations of the best fit 
values of the parameters of Equation 
10 under a number of independent 
conditions. For example, if this pro- 
cedure is repeated at a number of 
different wave lengths in the red 
region of the spectrum, the best fit 
values of K,, K,, and J should be 
identical at each wave length, but 
the values of A, and A, should trace 
out the spectral sensitivity curves of 
the R and G systems. 

The quantitative procedure used 
for finding parameters of the color 
phenomena may be applied to the 
achromatic brightness reversal phe- 
nomenon as well. However, the par- 
ticular stimulus configuration used to 
gather the brightness reversal data 
presented in this paper do not lend 
themselves to straightforward analy- 
sis. For example, it is almost certain 
that the illuminances at the borders of 
a retinal region are more important in 
determining the apparent brightness 
of the region than are the illuminances 
in other regions; and, for a bar against 
a background, stray light and small 
eye movements have large and in- 
determinate effects on the effective 
illuminances at the borders. An ex- 
periment is currently in progress in 
which these factors are minimized, 
while the time course of brightness 
reversal is determined in much the 
same way as before. The results of 
this experiment should be directly 
analyzable in terms of the photochem- 
ical hypothesis presented above. 

Observations on the rate at which 
the hue of the stimulus changes are 
qualitatively consistent with the the- 
ory, as plotted in the lower part of 
Figure 2. That is, the first change 
through yellow passes quickly, the 
stimulus actually appearing to be yel- 
low for a very short time. The second 
change, where the theoretically de- 
termined output curves are running 
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more nearly parallel, occurs very 
slowly and gradually. This is borne 
out quantitatively by the fact that 
the variance of the judgments of the 
time of second yellow point is appre- 
ciably greater than the variance of 
the first yellow point. The bright- 
nesses of the stimuli also agree quali- 
tatively with the theory. Although 
brightness has not been measured 
for the monochromatic stimuli, it is 
always high at first, gradually drop- 
ping to a much lower equilibrium 
level. 

From the theory as it has so far 
been stated, it might be expected 
that, just as a red light turns green, 
a green light should turn red, and a 
yellow one should not change in hue. 
This would indeed be the prediction 
if it happened to be true that (a) the 
rates of regeneration of the red and 
green systems were identical, and (0) 
the density of red and green pigment 
But if 


molecules were also identical. 
one regeneration rate were greater than 
the other, high-luminance stimuli near 
the yellow region would, at equalib- 
rium, take on the hue of the system 


that regenerates faster. From the 
present findings, this hue seems to be 
red, but the data are not yet complete 
enough to draw firm conclusions 
about the relative rates of regenera- 
tion. 

The fact that yellow and greenish 
yellow stimuli change toward purer 
greens before changing back through 
yellow would follow directly from the 
theory if there is more green than red 
pigment present in the stimulated 
region of the retina. Consider, for 
example, light of a wave length that 
looks yellow when it is first turned on. 
This means that it is initially isomer- 
izing red and green pigment molecules 
at the same rate (again assuming that 
C =1 for convenience). However, 
if there are actually fewer red mole- 


cules present per unit of area, the rate 
of isomerization of red molecules will 
immediately become smaller than the 
corresponding rate for the green sys- 
tem, and the stimulus should turn 
green. This sort of effect is superim- 
posed upon the processes already dis- 
cussed so that, at the proper irradi- 
ances, an initially green stimulus 
should first get greener, then change 
back through yellow to red, and 
finally again through yellow to green, 
as it does. 

It was mentioned above that, when 
the wave length is not too long (in 
the “orange’’) and the irradiance is 
extremely high, the early stages of 
hue change seem to be obscured by a 
wave of desaturated pink or violet. 
This effect would be expected if 
short wave sensitive systems were 
present and did produce desaturation. 
In this case, during the early stages 
of adaptation, both the R and the G 
systems are so intensely bombarded, 
relative to the short wave sensitive 
systems, that the R and G outputs 
drop to values comparable with the 
short wave systems, and desaturation 
occurs. 

Note: One should exercise caution 
when viewing stimuli of very high 
luminance. Fechner permanently 
damaged his retinae by staring at the 
sun, and ophthalmologists can count 
on an increased patient load after 
solar eclipses. Since there have been 
very few systematic studies of the 
pathological effects of intense lights 
(see Whiteside, 1960), there are no 
clear rules on what should be avoided. 
In general, the sun should not be used 
as a source unless it is very strongly 
filtered, and any source that contains 
a large amount of infrared radiation 
should be filtered by heat-absorbing 
glass. The writer and many others 
who have observed the phenomena 
discussed in this paper exercised these 
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precautions and experienced no dis- 
comfort except for afterimages that 
last for a few minutes. 


SUMMARY 


Evidence concerning the absorption 
of light quanta by visual pigment 
molecules is used to derive a theoret- 
ical relationship between the param- 
eters of light incident on the retina 
and the output of the receptor cells. 
This relationship leads to the follow- 
ing prediction: when two retinal areas 
are illuminated, one very intensely 
and the other still more intensely, 
there will be a period of time during 
which the less intensely illuminated 
region will appear brighter than the 
more intensely illuminated one. 
Data verifving this prediction are 
presented. 

When the theoretical treatment is 
expanded to include certain assump- 
tions about retinal color mechanisms, 


it provides an explanation of some of 
the changes in hue that occur when a 
very intense colored stimulus is viewed 


steadily. Data on the temporal 
course of these hue changes are pre- 
sented, and their relationship to the 
theory is discussed. This kind of 
analysis is shown to be a possible 
means for the quantitative evaluation 
of certain physiological properties 
of the visual system. 
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PRACTICE AS A PROCESS OF SIMPLIFICATION 


MARSHALL B. JONES! 


United States Naval School of Aviation Medicine, Pensacola, Florida 


From time to time in the last 30 
years a psychologist has intercorre- 
lated successive trials in the practice 
of some task or other. And in every 
instance the correlations have ar- 
ranged themselves in a very definite 
pattern. 

The correlations which appear in 
Table 1 are a good example. The 
task here is the Two-Hand Coordina- 
tion Test, which requires the subject 
to move a pin around a clover shaped 


TABLE 1 
INTERTRIAL CORRELATIONS FOR THE 
Two-HANnpD CoorRDINATION TEST 








runway by the coordinated turning of 


two control handles. The subjects 
are 152 basic airmen; and the trials 
are 60 seconds long, with 30-second 
rest intervals between trials.” 

But notice the pattern. For each 
trial the correlations are highest in 
the superdiagonal (just above the 
main diagonal) and decrease mono- 
tonically as we move either upwards 

1 Opinions or conclusions contained in this 
paper are those of the author. They are not 
to be construed as necessarily reflecting the 
view or the endorsement of the Navy De- 
partment. 


2? For the use of these data I am indebted 
to E. A. Bilodeau. 


or to the right, until we come to the 
smallest correlation, which is in the 
upper right-hand corner of the matrix. 
There are a few inversions in the last 
two columns but they are minor, 
amounting in no case to more than .02. 

This pattern, this “superdiagonal 
form” as I will call it, can be seen in 
every practice matrix which has ever 
been published. It seems to be a 
universal property of practice that a 
trial correlates strongest with its 
neighbors and weakest with the first 
and last trials in the series (Edgerton 
& Valentine, 1935; Fleishman, 1953, 
1960; Fleishman & Hempel, 1954, 
1955; Fleishman & Parker, 1959; 
Greene, 1943; Houston, 1950; Melton, 
1947, p. 1022; Perl, 1934; Reynolds, 
1952a, 1952b; Viteles, 1933). 


PRINCIPAL POSSIBILITIES 


Now what are we to make of this 
fact? Structurally, there are three 
principal possibilities. The first, in 
an historical sense, is the Woodrow 
two-factor structure. 

In 1939 Herbert Woodrow observed 
that every practice matrix which was 
known at that time could be explained 
in terms of two common factors with 
positive weights which increased on 
one factor and decreased on the other 
as one went through the practice 
series. In other words, he posited 
that: 


= a,F, + bi Fo + e151 
= oF; + beFs “fF CoSe 
os a3F; + b3 Fe + C353 


a,F; -t b, Fs a Cada 
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where the X; were the trial scores, F; 
and F, were the two common factors, 
the S; were specific factors, and load- 
ings, a, and b,, obeyed these laws: 


0< as a2S-°-< an i< an 


and 
bi> be> b3>---> b> 0 


We now know that the Woodrow 
structure cannot explain all practice 
matrices. Sometimes, particularly in 
long matrices, a third Woodrow factor 
is required, or even a fourth. Never- 
theless, the great bulk of practice 
matrices can be explained in terms of 
two Woodrow factors, and the present 
matrix is a case in point. If we 
hypothesize that: 


.6 5 = .92 by = on bs = .24 
94 b=.! be = .19 


94 bs =. b; 19 
94 by = .39 bg = .19 


we get the correlations which appear 
in Table 2. Clearly, the fit is good. 
So, as far as the correlations are con- 
cerned, the Woodrow structure is a 
tenable hypothesis. 

The second principal possibility is 
the ‘“‘double accumulation structure.” 
For an_ eight-variable matrix the 
equations are: 


n=f/itf 
X2=fi +fe 


tfhstfitfstle +1 +5; 

Siatse 
+fiotfistss 
tfutfiatfiatse 


tHhstfitlstle +/r 


s=fiththtithtle 
a=fitfeotfetlitls 


Ns=fithethsths thiotfuthfiatfistss 


Xe=hithet/s tfhothothiutfetfistse 
X7=fit/e Hystfothiotfiuthiathistsz 


X*8=f; thetfothiotfiutfietlistses 


where the trial scores are in standard 
form and all factor scores are in devia- 
tion form only.’ In 


some respects 


5 In the double accumulation structure and 
the simplex the right-hand side of each struc- 


TABLE 2 


THEORETICAL CORRELATIONS FOR THE Two- 
HAND COORDINATION TEST UNDER A 
Wooprow HyporuHesis 


| | 
X2| Xs | Xe | Xe | Xe 
|.78| .77 |.75| .70 $8 | 
.89 | .87| .83 | .81/. 
91 | .88 | .87| . 
89 | .89 | 


91 


double accumulation resembles Wood- 
row structure. Both hypotheses call 
for a two-process interpretation of 
practice, in which one process gains 
in importance with practice and the 
other loses. They differ, however, in 
the way these two processes are 
conceived. In the Woodrow struc- 
ture each process is represented by a 
single factor with weights which de- 
crease or increase as practice proceeds. 
In the double accumulation structure 
each process is represented by a set of 
factors in which the included factors 
accumulate or drop out with each new 
trial. 

The double accumulation structure 
is a special case of the ‘open contiguity 
structure,’’ and there are some prac- 
tice matrices which require the more 
generalform. In the present example, 
however, if we hypothesize that 
oy = .70 and that: 
o/,2=.00 o/,2=.02 


os," = .01 


2— ()? 
8 o7,,°=.02 


o;,2 =.03 o,,2=.04 


o7,2=.00 o;,2=.02 ay, 


o72=.03 o;,7°=.02 o;,,2=.02 o,,,2?=.12 
tural equation should be multiplied by an 
arbitrary constant. Otherwise, the structure 
conditions the variances and covariances as 
well as the correlations. Ordinarily, these 
constants are dropped as their inclusion would 
only complicate the equations to no particular 
advantage. Whenever we deal with vari- 
ances or covariances, however, these con- 
stants must be reinstated. 
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TABLE 3 


[THEORETICAL CORRELATIONS FOR THE Two 
HAND COORDINATION TEST UNDER A 
DouBLE-ACCUMULATION HYPOTHESIS 


Trial] Xi: | X2| X: | Xe} Xs | Xe 


| 82 | 
| 86). 
88 | 


the double accumulation structure 
generates the correlations which ap- 
pear in Table 3. In this instance, 
therefore, as in most practice matri- 
ces, the correlations are easily ex- 
plained in terms of a double accumu- 
lation hypothesis. 

The third principal possibility, the 
simplex, differs fundamentally from 
the first two in that it calls for a one- 
process interpretation of practice. 
Practice, under a simplicial interpre- 
tation, is a process of simplification, 
exclusively. For an _ eight-variable 
matrix the equations are: 

Xi=fitfetfstfatfstfetfitfstsi 
Sot fat fatfstfotfitfet se 

fatfatfotfoetfitfstss 


x2>= 


Sitfetsz 
fstss 


where all variables are in deviation 
form only.‘ 


‘The requirement in the simplex that all 
variables be in deviation form only is im- 
portant. Were we to require that the trial 
scores be expressed in standard form or, 
which is the same thing, that the trial scores 
all have the same variance (except for the 
arbitrary constants), then the “simplex” 
would become a single accumulation structure 
—which is a perfectly valid structure—but it 
has no place in the context of practice because 
it does not generate a superdiagonal form. 
For more on the single accumulation struc- 
ture, see Jones (1960, pp. 50-51). 


B. JoNEs 


To generate specific correlations 
from a simplex we must hypothesize 
two kinds of parameters: the com- 
munalities 


Sete 


h? = 


2 
Oz, 


and the complexities 

(“4 5 oo cat) 

c ‘ _ “ o a , 2 

os, tes+ os,” 
In the present instance, if we set: 
h?=.91 ¢,=1.00 
he= .93 Cc2= .93 
h?= .94 C3= .90 
h?= .94 = 88 


C5=.86 
Co=.83 
C7=.83 
Cs=.83 


we get the correlations which appear 
in Table 4 and a good fit. This par- 
ticular example, moreover, is not at 
all exceptional. Every known matrix 
of correlations between trials of prac- 
tice can be fit simplicially. 

I should also point out that, right 
now, there does not exist any way of 
testing the significance of these fits. 
However, no one of these hypotheses 
leaves a residual correlation greater 
than .02, and we hardly need a test to 
tell us that residuals of this magnitude 
are not significant. Even if they were 
significant, the fact would be of no 
particular moment. True, the hy- 
potheses would have to be modified ; 


TABLE 4 


THEORETICAL CORRELATIONS FOR THE Two- 
HAND COORDINATION TEST UNDER A 
SIMPLICIAL HYPOTHESIS 


Xa | Xs Xo 


.75 | .73 | .71 
.85 | .8: 81 


86). 
87 
89 


.89 
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but the modifications would be so 


paltry in amount that nothing of any 
real consequence would be altered. 


MoLarR CORRELATIONAL ANALYSIS 


To the learning theorist the next 
question is obvious: Which of these 
three structures best expresses the 
true nature of practice? I intend to 
answer this question, but first | want 
to say a word or two about correla- 
tional analysis. 

For the last 30 years the treatment 
of correlation matrices in psychology 
has been completely dominated by 
factor analysis. There has been criti- 
cism, of course. In fact, there are a 
great many psychologists, perhaps 
even a majority, who won’t have any 
part of factor analysis. But for two 
generations now there has been no 
alternative to factor analysis. If a 
man calculated a correlation matrix, 
the only existing way of summarizing 
his result in structural terms has been 
the factor analytic way. In the last 
few years, however, this situation has 
changed. Beginning with Louis 
Guttman’s work on the simplex in 
1954 and continuing with my two 
monographs on correlational analysis 
(Jones, 1959, 1960), an entirely new 
discipline has been created, a com- 
pletely different way of treating a 
correlation matrix. 

This new discipline, molar correla- 
tional analysis, is coextensive with 
factor analysis in practice and ab- 
solutely incompatible with it in the- 
ory. The essence of this incompati- 
bility isa matter of approach. Factor 
analysis is an inductive method. The 
factor analyst starts with the correla- 
tions and then attempts by a perfectly 
mechanical computing regime to grind 
out his factor structure. He knows 
very well, of course, that there are a 
great many, substantially different 
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structures any one of which could 
generate the correlations. But the 
factor analyst has faith. Aside from 
their intercorrelations he may know 
nothing whatever about the variables 
which enter into the matrix. Never- 
theless, he remains quite confident 
that by estimating communalities, 
factoring the matrix, and rotating 
axes (according to his preferred 
method) he will arrive at the structure 
which is actually responsible for the 
correlations being what they are. 

The molar analyst is a much less 
believing creature. As he sees it, 
correlational analysis is a strictly 
deductive discipline. If we are given 
some hypothesis as to how the vari- 
ables are structured, then algebra 
alone can tell us what the correlations 
ought to be—but that’s all it can do. 
The construction of theories is a busi- 
ness for the psychologist, as psycholo- 
gist, not the statistician. All the 
statistician can do is to give us the 
means for tracing out the conse- 
quences of our thinking. He can tell 
us which patterns follow from which 
structures. He can catalogue the 
more important of these structures 
and these patterns. He can even 
show us how to fit a structure to a 
correlation matrix. But beyond this 
he simply cannot go. 

In itself, this difference would en- 
able us to construct an entirely differ- 
ent approach to the treatment of 
correlation matrices. But there are 
several other respects in which molar 
analysis differs fundamentally from 
the factor analytic tradition. The 
most important of these is the em- 
phasis which the molar analyst places 
upon correlation pattern. A correla- 
tion matrix is more than a collection of 
correlations. As often as not, these 
correlations, or a substantial number 
of them, display a definite pattern. 
We have already seen the superdi- 
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there are 
monotonic 


agonal form. But many 
others well: the hi- 
erarchy, for example, or circumplicial 
form, or rippled patterns, or tiered 
patterns, or three-cornered patterns. 
The factor analyst pays no attention 
to these patterns. To him it makes 
no difference how the matrix is 
patterned or even that it is patterned ; 
under all circumstances he factors the 
matrix and rotates axes according to 
his a priori convictions. To the molar 
analyst, however, these patterns are 
an essential part, indeed, the most es- 
sential part, of his empirical results.® 


as 


Another basic difference between 
molar and factor analysis has to do 
with the idea of structure. If his 
thinking requires it, the molar analyst 
can and does work with factor struc- 
tures of the conventional type, as for 
example, in the Woodrow two-factor 
structure. But he isn’t content with 
these structures. The factors of a 
conventional structure bear no organic 
relationship one to the other. They 
aren’t even mutually exclusive; each 
variable has so much of this factor and 
so much of that one, and that’s it. 
But factor structures can also be or- 
ganized, just as correlations can be. 
Consider, for example, the double 
accumulation structure or the simplex. 
These structures are much more than 
a list of factors. They are organized 
about principles, accumulation in the 


one case and inclusion in the other, 


’ When a correlation matrix comes off the 
machines, the variables which enter into it 
may or may not be ordered so as to reveal 
whatever patterns it contains. Ina simplex, 
for example, the variables must be ordered 
from simple to complex or vice versa if the 
superdiagonal form is to appear. If the vari- 
ables aren't ordered this way at first, then we 
must reorder them until we find the pattern 
In my “Molar Correlational Analysis” 
(Jones, 1960, p. 75) I describe a device which 
makes it much easier to rummage around in a 
correlation matrix this way. 


B. Jones 


which have a meaning altogether in- 
dependent of the individual factors 
which make them up. These same 
principles, however, also operate in 
psychological theory. Ideas like con- 
tiguity, inclusion, accumulation, or 
competition are constantly recurring 
themes of psychological thought. But 
if we are to use these ideas in thinking 
about correlational results, we must 
have factor structures which can 
express them. 

Altogether, the idea of molar cor- 
relational analysis is best understood 
through an analogy with curve fitting. 
In curve fitting we are presented initi- 
ally with a plot of points. Ordi- 
narily, these points have a shape, how- 
ever obscured it may be by error. 
This shape may be unimodal, bimodal, 
S shaped, U shaped, or what have 
you; but whatever it is, the first thing 
that the curve fitter does is to look at 
the plotted points. Then, knowing 
something about analytic geometry 
and the nature of the two variables 
which are being related, he leaps to an 
hypothesis. This plot, he says, can 
be generated by a parabola, or a 
straight line, or a power function. 
There is, of course, no guarantee that 
he is right. There are always an 
infinity of functional forms which can 
fit any set of plotted points. But the 
curve fitter picks out one of these as 
his personal notion of what the rela- 
tionship actually is. And finally, he 
attempts a quantitative fit. The fact 
that a plot of points is roughly U 
shaped does not mean that it can be 
fit by a parabola. First, we must 
posit values for the parameters of the 
parabolic function and generate a 
theoretical locus of points. Then, if 
the fit is good, we can conclude that the 


equation hypothesized by the worker 

is at least a tenable hypothesis. 
Molar correlational analysis follows 

exactly the same sequence of events. 
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The correlations correspond to the 
plotted points; the pattern of the cor- 
relations corresponds to the shape of 
the plotted points; and the molar 
structure corresponds to the functional 
equations of the curve fitter. Like 
the curve fitter, the molar analyst 
begins by looking at his data. Then, 
knowing what the variables are that 
are being correlated and knowing 
something about which structures 
generate which patterns, he leaps to 
an hypothesis. This matrix is gen- 
erated by a Woodrow structure, by a 
double accumulation structure, or by 
a simplex. Then finally, he attempts 
a quantitative fit. It he can posit 
values for the structural parameters 
which yield a good fit to the empirical 
correlations, then his hypothesis is 
one possible interpretation of the 
facts. If not, he can safely afford to 
reject it. 

From a technical point of view, the 
central act of molar correlational 


analysis is fitting a structure to the 


correlation matrix. And let me em- 
phasize once again that in this fitting 
the molar analyst is not looking for a 
structural hypothesis, as the factor 
analyst does when he factors the ma- 
trix and rotates axes. The molar 
analyst has already hypothesized his 
structure; his sole purpose is to deter- 
mine the best fitting values for the 
structural parameters. In the body 
of this paper I will not present fitting 
procedures for any of the structures 
we have under consideration. How- 
ever, | have appended a Supplemen- 
tary Note in which these procedures 
are taken up at length. 

Finally, let me put in a word about 
“simplicity.” When we hypothesize 
a Woodrow structure, a double ac- 
cumulation structure, or a simplex, we 
are speaking of the form of our hypoth- 
esis, in exactly the same sense that 
we speak of parabolic, linear, or log- 
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arithmic forms in curve fitting. The 
simplicity of the hypothesis is deter- 
mined by the number of data we have 
to explain and the number of arbitrary 
structural parameters we must hy- 
pothesize to explain them. In the 
Woodrow structure there are (2) of 
these parameters, two factors with n 
loadings (the arbitrary parameters) 
on each; in the double accumulation 
structure there are (2m-3) structural 
parameters, which in this case are the 
factor variances; and in the simplex 
there are (2m-1) parameters,® the n 
communalities and the (n-1) com- 
plexities (one of the complexities is 
arbitrary). All told, therefore, the 
double accumulation structure is the 
simplest explanation ; then comes the 
simplex, and lastly the Woodrow 
two-factor structure. 

Let me emphasize, however, that in 
molar analysis this whole question of 
simplicity is a purely technical detail. 
The molar analyst, unlike the factor 
analyst, lays no great philosophic 
stress upon simplicity. For him, the 
task which confronts the correlational 
theorist is to propose true theories, 
not simple ones. His problem is not 
to explain the correlations in this one 
matrix in the simplest possible terms 
but, rather, to integrate the correla- 
tions with what else he knows about 
the correlated variables. And accord- 
ing as he succeeds or fails in this di- 
rection, he expects his theories to 
stand or fall. 

®In the past I have not regarded the com- 
munalities in a simplex as structural param- 
eters altogether on a par with the com- 
plexities. My presentation in this paper, 
therefore, marks a definite change of position. 
The principal consequence is that a simplex, 
under this new position is not as simple a 
structure as I had previously made it out to 
be, not by half. It was this consequence, I 
suppose,—and a lingering whisp of the factor 
analytic miasma—which led me into my 
original error, but an error it incontestably 
was, 
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SIMPLICIAL ARGUMENT 


Let’s return now to the problem at 
hand, the superdiagonal forms of prac- 
tice and their structural interpreta- 
tion. At the present stage of the 
game the arguments for and against 
our three main hypotheses are far 
from decisive. In my opinion, the 
simplex has the best of it on the avail- 
able evidence, but we have a long way 
to go before any one of these hypoth- 
eses or, for that matter, any other 
explanation of the correlations can be 
regarded as definitely established. 

The case for the simplex rests on 
two main arguments. The first, and 
most weighty, comes from experimen- 
tal psychology. Since the days of 
Bekhterev, psychologists have known 
that learning involved the progressive 
focussing of behavior on the one re- 
sponse which was successful in secur- 
ing reward. At first the dog starts 
wildly in the stock. He makes all 
sorts of inappropriate responses; in- 
deed, initially, it is only by accident 
that he lifts his paw. As practice 
continues, however, his behavior grad- 
ually focalizes, until in the end the 
dog barely lifts his paw the amount 
required to avoid the shock. As we 
observe it, in other words, practice is 
a process simplification. How 
very natural therefore, that in its 
differential composition (which we do 
not observe) practice should also be- 
come simpler and simpler as one trial 
follows another. 

In making this argument I want to 
be quite clear that I am talking about 
practice, not learning. There are 
some forms of learning, problem solv- 
ing for example, which almost cer- 
tainly are not processes of simplifica- 
If Krechevsky (1932) had in- 
his 


of 
yi 


tion. 
tercorrelated successive trials in 


famous studies of ‘hypothesis’ forma- 


tion in rats, it is quite possible that 


B. Jones 


the correlations would have shown 
the superdiagonal form. But in this 
case the originating structure could 
hardly have been a simplex. The 
rats’ ‘“‘hypotheses,”’ after all, came and 
went. First they tried one thing and 
then another, first a spatial hypothe- 
sis and then a visual one. This is the 
kind of behavior which generates an 
open contiguum, a structure in which 
factors can come as well as go, not a 
simplex, in which they can only go 
(or come).’? By ‘practice’ I mean 
that portion of the learning sequence 
which follows the formation of the 
correct hypothesis. Learning doesn’t 
end just because the person or animal 
has finally understood the problem. 
There remains the “stamping-in,”’ the 
mastery of the problem. And it is 
this stamping-in, this practice, which 
the simplicial hypothesis requires to 
be a process of simplification. 

The second argument has to do with 
the external relations of early and late 
stages of practice. According to the 
simplicial hypothesis, the early stages 
of practice contain a variety of factors 
in addition to those which are most 
particularly required in the practice 
task itself. This hypothesis implies 
that, if we correlate the trials of prac- 
tice with materials which are categori- 
cally distinct from the practice task, 
then, on the average, the early stages 
of practice should be more strongly 
related to the test materials than the 
later ones. On the other hand, if we 

7 If a correlation matrix can be explained 
by a simplex with progressive simplification, 
it can also, as a matter of analytic necessity, 
be explained by a simplex with progressive 
complications. From a correlational point of 
view alone the possibility cannot be dismissed 
that the trials become increasingly complex 
with practice. I have rejected this notion 
for the same reasons I have accepted its in- 
verse of progressive simplification, namely, 
its contradiction by the overt behavior of the 
subjects and by the correlations of the trials 
of practice with external measures. 
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correlate the trials of practice with 
very similar materials, we should ex- 
pect just the reverse result: in general, 
the later stages should be more 
strongly related to the test materials 
than the early ones. 

Unfortunately, the only evidence we 
have which is relevant to these two 
consequences concerns psychomotor 
tasks. In this one area, however, the 
evidence is very clear-cut. When 
they used printed tests as the external 
measures, Adams (1953, 1957), Fleish- 
man (1960), Fleishman and Hempel 
(1954, 1955), and Reynolds (1952a, 
1952b) all found that the predictabil- 
ity of the trials on a psychomotor 
task generally decreased with prac- 
tice. On the other hand, these same 
investigators have also shown that the 
relationship between one psychomotor 
task and another usually increases 
with practice, on either one of the two 
tasks or both. And if we take an area 
more or less in between, say training 
criteria, we find a more or less neutral 
result : for example, the correlation be- 
tween the Air Force pass-fail pilot 
criterion and scores on the Complex 
Coordination Test neither increases or 
decreases with practice. Unless we 
make special assumptions, however, 
none of these results follows from 
either the Woodrow or the double 
accumulation structure. All told, 
therefore, the existing evidence on the 
relationship between trials of practice 
and external measures strongly sup- 
ports the simplicial hypothesis. 


IDENTIFYING FACTORS 


In all branches of study the rec- 
ognition of form is followed by the 


analysis into parts. Once the astron- 
omer has recognized a star he wants 
to know what it’s made of. Once 
the biochemist has isolated a sub- 
stance he wants to know it’s composi- 
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tion. And once the psychologist has 
decided, even tentatively, that prac- 
tice is a process of simplification he 
wants to know what the elements are 
which participate in this process; he 
wants to identify the simplicial factors. 

Now, the only way the psychologist 
can identify these f; is to administer a 
variety of other tests to the same sub- 
jects who worked on the practice task. 
Once this is done, he can correlate 
the reference tests with the stages of 
practice and, through these correla- 
tions, obtain the correlations between 
the reference tests and the simplicial 
factors. If these final correlations 
are high, then the factors are identi- 
fied. If not, we must try again, with 
other tests. 

To illustrate this logic let us rework 
the results of a recent study by E. A. 
Fleishman.* In Table 5 I have repro- 
duced the intertrial correlations on 


8 In his analysis of these data Fleishman 
threw both the trials of practice and the refer- 
ence tests into one grand matrix and per- 
formed a factor analysis. There is, I think, 
a fatal objection to this procedure, quite 
apart from Fleishman's use of factor analysis. 
It is that the structure of the practice task 
becomes dependent upon the test materials. 
If we use one battery of tests, we will get one 
structure for the trials of practice; if we use 
another, we will get a second; if we use a third 
battery, we will get still another. But it’s 
obvious, it seems to me, that the trials of 
practice on, say, the Rotary Pursuit Test 
have some structure, altogether apart from 
the tests we happen to administer. This 
structure doesn’t have to be a simplex. Asa 
factor analyst, Fleishman might very well pre- 
fer the Woodrow structure. But however 
we interpret them, it seems quite clear that 
there is only one true structure for a single 
set of variables. Under Fleishman’s ap- 
proach the factors we are trying to identify 
are a function of the materials with which we 
are trying to identify them. The only way to 
resolve this dilemma is the molar way, to 
analyze the intertrial correlations first, apart 
from the reference tests, and then, having 
arrived at a structural recognition of the 
factors, to attempt an indentification in 
substance. 
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TABLE 5 


INTERTRIAL CORRELATIONS FOR THE ROTARY 
Pursuit Test 


* Fleishman reported his results for the odd-num- 
bered trials only. 


the practice task, which was the 
Rotary Pursuit Test. This matrix 
has the superdiagonal form. And if 
we hypothesize a simplex and set: 


Cs5=.73 


h?=.86 h?=.96 c,=1.00 
- =.70 


h?=.92 h®=.94 c2= 
h3=.94 h?=.90 .69 


hg=.98 hs? =.82 , -3=.6/ 


correlations 
Since the 


we get the theoretical 
which appear in Table 6. 
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TABLE 6 
THEORETICAL CORRELATIONS FOR 
Rotary Pursuir TEST UNDER A 
SIMPLICIAL HYPOTHESIS 


THE 


7 | 9 | il | 13 | 15 
.72 | .66 | .63| .61 
83 | .77 | .73| .71 
.91| .85 | 80] .77 
91 | 86} .83 
91 


56 
65 
72 
77 





89 | .82 
91 | .84 
| .84 





























fit is good, our hypothesis is tenable. 
It remains to find out what the hy- 
pothesized factors are, not structurally 
(we already know that) but substanti- 
ally. 

In addition to the Rotary Pursuit 
Test Fleishman administered a battery 
of 17 reference tests, 7 paper-and-pencil 
tests, and 10 apparatus tests. Now, 

® The reader will notice that for the printed 


tests, Tests 1 through 7, the early trials are 
more strongly related than the late ones, 


TABLE 7 


CORRELATIONS BETWEEN THE 


General mechanics 
lool functions 
Speed of identification 


mM MN 


Instrument comprehension 
Visual pursuit 

\iming 

Marking accuracy 

Purdue assembly board 
Santa Ana dexterity 

Rate control 

Linear pursuitmeter 


Nm Ww 


mw Ww Ww 
OO 
So w 


Nh Ww 
eS 


Rate of movement 
Rotary aiming 
Reaction Time I 
rrack tracing 
Complex coordination 


lime II 


Reaction 


REFERENCE 


TESTS AND THE TRIALS OF PRACTICE 


Trials 


11 | 13 


19 
14 


.26 


Nw 


24 


= Wh hhh me ee 
Conmr tN 


~ 
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TABLE 8 


CORRELATIONS BETWEEN 


General mechanics 
Tool functions 
Speed of identification 
Instrument comprehension 
Visual pursuit 

Aiming 

Marking accuracy 

Purdue assembly 

Santa Ana dexterity 

Rate control 

Linear pursuitmeter 

Rate of movement 

Rotary aiming 

Reaction Time I 

Track tracing 

Complex coordination 
Reaction Time II 








if we assume that the test-trial cor- 
relations, which appear in Table 7, 
are mediated by the simplicial factors 
only, we can calculate the correlations 
between the tests and the simplicial 
factors (Jones, 1960, pp. 28-31, 80 
83). These last correlations, of course, 
are not necessarily exact, because the 
assumption under which they were 
calculated could be in error. How- 
ever, when the simplicial communali- 
ties are as high as they are for the 
Rotary Pursuit Test, ranging from .86 
to .98—and the test-trial correlations 
are as low as they are in this instance 

we run little risk of serious error in 
accepting the hypothesis of exclusive 
And under this 
hypothesis, the correlations between 
the reference tests and the simplicial 
factors take on the values which ap- 
pear in Table 8. 


simplicial mediation. 


whereas for the apparatus rests 8 
through 17, just the reverse is true. In this 
case, these tendencies are weaker than they 
are in any other published study, but even 
so they are still there. 


tests, 


THE REFERENCE TESTS AND THE SIMPLICIAL FACTORS 


Factors 


fs | fa | fe 


00} 06] -. 
—10} .03| —-. 
00} 12 
.20 .06 

















These values, unhappily, are low. 
Not one of them ranges beyond the 
40s and only three are that high. In 
this particular instance, therefore, our 
effort to identify the simplicial factors 
was a failure. Nevertheless, there is 
nothing the matter with the method; 
if we persist with it, then sooner or 
later we will find tests which correlate 
strongly with the simplicial factors. 
When this happens, we will have 
identified the f;. 


(CONCLUSION 


In the end, of course, hypothesizing 
structures and identifying their fac- 
torial elements will not suffice. If we 
really want to understand the super- 
diagonal forms of practice, there is 
nothing for it but to experiment with 
them. How would these patterns 
change as a function of distribution of 
practice, or of intermittent reward? 
How would they be affected by prior 
practice on a related task? Sooner or 
later we must augment the passive 
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with 
must 


arts of correlational analysis 
active experimentation. We 
theorize, of course ; and this theorizing 
will inevitably be correlational in 
character. But we must also ex- 
periment. 
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SUPPLEMENTARY NOTE 


In this note I propose to take up the 
fitting procedures for the three structures 
which were treated in the text of this 
paper, the Woodrow struc- 
ture, the double accumulation structure, 
and the simplex. As a necessary pre- 
liminary, I will also treat a few, more 
elementary forms. 

Mathematically, the 
any molar structure is by the method of 
Given a structural hy- 
pothesis, we know that the theoretical 
correlations, p;;, are some function of the 
structural parameters, p:, p2,--+, Pn. 
We could, therefore, proceed to minimize 


two-factor 


best way to fit 


least squares. 


the squares of the differences between the 
theoretical and the empirical correlations, 
i.e., between the p;; and the r;;. To do 
this we need only differentiate the ex- 
pression : 


ye (9:3 — 155)? 
tj 


and solve the normal equations which 
result. 

There is, however, a difficulty with 
this approach: The number of normal 
equations which must be solved in order 
to determine a best fit is prohibitive. If 
we use least squares, every fitting process 
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requires us to solve at least as many 
simultaneous equations as there are 
variables in the matrix. And this, in its 
turn, requires computing equipment of a 
high order. It is desirable, therefore, to 
find other, means of 
arriving at essentially the same results. 
In all the fitting procedures which | 
will describe the general process is the 
We begin by positing a tentative 
set of parameters, which we then adjust 
until we get what appears to be a best 
fit. These tentative values, of course, 
are not taken at random. Knowing 
what the structure is, we can determine 
tentative values which will require a 
minimum of adjustment. The only 
problem is to develop sound procedures 
for arriving at these tentative values. 


some less heroic 


same. 


SPEARMAN UNIT FACTOR 


Let us begin with the Spearman unit 
factor, i.e., with the hypothesis that the 
which enter into the matrix 
contain a single common factor. If the 
matrix is arranged so that the variable 
with the biggest loading on this one 
factor comes first, the variable with the 
next biggest loading second, and so on, 
down to the variable with the smallest 
loading, which comes last, the structure 
generates a characteristic pattern. This 
pattern, which I call a “monotonic 
hierarchy,” locates the biggest correlation 
in the upper left-hand corner of the 
matrix and the smallest in the lower 
right-hand corner with the others spread 
out in between. If the loadings are ar- 
ranged the other way around, i.e., with 
the smallest loading first and the biggest 
last, then the pattern will be reversed: 
The biggest correlation will be in the 


variables 


TABLE Al 


A Monotonic HIERARCHY 


xX 1 4 ; | xX S| 


.64 


AS A PROCESS OF 


SIMPLIFICATION 


TABLE A2 


SoME THEORETICAL CORRELATIONS UNDER A 
SPEARMAN UNIT-—FAcCTOR INTERPRETATION 


Xs Xs 





64 .66 
70 72 
73 





lower right-hand corner and the smallest 
in the upper left-hand corner. 

The matrix in Table Al displays this 
second form of the monotonic hierarchy. 
On a purely qualitative basis, therefore, 
the hypothesis of a Spearman unit factor 
is tenable. But before we can accept 
this hypothesis as a possible explanation 
of the correlations, we must first specify 
values for the factor loadings and at- 
tempt a fit. 

Under a Spearman 
empirical correlations: 


hypothesis the 


rij aia; 


where a; is the loading on the one com- 
mon factor of the ith variable in the 
matrix. It follows that: 


Ti27 13 , 
a, ~ — 
r23 
(Teste) 
a, ~ tinea 
T(n—-1)n 


In other words, working with the two 
“ends” of the matrix we obtain two 
tentative values for the first factor load- 
ing, a}. If we compromise these two 
values, we will get a ‘‘balanced”’ single 
value, which, in this instance, equals .76. 
Then, since: 
a 

a 
we can go ahead and determine the ten- 
tative values for the remaining variables. 
In the case at hand we get these values: 


a, = 716 a3 = .84 
ade .83 a4 = .87 


Generating the theoretical correlations 
from these parameters we get the matrix 
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which appears in Table A2. Ordinarily, 
we are able to improve upon the fit by 
adjusting the a;, but in this instance the 
fit is just about perfect as it stands, since 
the biggest residual correlation is .01, 
and only one is that big. It’s quite 
clear, therefore, that the matrix can be 
explained as originating in a single com- 
mon factor. 


AUGMENTED SPEARMAN 
Unit FAcToR 


By an augmented Spearman unit 
factor I mean a two-factor structure in 
which the loadings of all the variables on 
one of the two factors are equal. Cor- 
relationally, the effect of this second 
factor is to add a constant quantity, 
which equals the loading on this second 
factor squared, to all of the correlations 
in the matrix. Now, consider the ma- 
trix which appears in Table A3. Quali- 
tatively, the pattern it displays, i.e., the 
monotonic hierarchy, is consistent with 
an augmented-Spearman interpretation. 
It remains to be seen, however, whether 
or not a good quantitative fit is possible. 

In fitting an augmented Spearman hy- 
pothesis our first task is to determine the 
magnitude of the one loading on the 
Consider, therefore, the 
three “principal points’’ of the matrix, 
i.e., the upper left-hand corner, the upper 
right-hand corner, and the lower right- 
hand Under a mnonaugmented 
Spearman hypothesis: 


second factor. 


corner. 


Tio (n—1)n — Tin? n-1) ~ 0 


If this quantity is either negative or zero, 
then the fit cannot be improved by means 
of the additive constant (which must be 
positive). If this difference is positive, 


TABLE A3 


ANOTHER Monotonic HIERACHY 


X2 
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TABLE A4 
SoME THEORETICAL CORRELATIONS UNDER AN 
AUGMENTED SPEARMAN UNIT-—FACTOR 
INTERPRETATION 


me | Xs Xs 


X, 88 
Xe 
X3 
X 4 


however, as in the present instance, then 
by subtracting a fixed positive amount 
from all the correlations in the matrix, 
the theoretical correlations can be 
brought more nearly into line with their 
empirical counterparts. Now, if we 
subtract .10 from all the correlations in 
the matrix, the tetrad difference between 
the principal points remains positive. 
In fact, it remains positive even if we 
subtract .50 from all of the correlations. 
If we subtract .60, however, the differ- 
ence becomes negative. Suppose, there- 
fore, that we take .50 as our tentative 
value for b?, the one loading on the sec- 
ond factor squared. If we subtract this 
value from all of the correlations in the 
matrix and fit a Spearman unit factor to 
the residual correlations, in the manner 
already described, we get these values 
for our five parameters: 


a, = .66 a3; = 
a; = .58 a, = 


which generate the correlations which 
appear in Table A4. 
is no need to adjust the tentative values 
for the parameters. But again it should 
be understood that adjustments are 


ordinarily required. 


Once again, there 


HaLF—Wooprow Two 
FACTOR STRUCTURE 


In the half-Woodrow, two-factor struc- 
ture the loadings on one of the two factors 
are the same for the first half of the 
series, while on the other factor the load- 
ings are the same for the second half of 
the series. In an eight-variable struc- 
ture, for example, the loadings on the 
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TABLE A5 


A THREE-CORNERED PATTERN 


Xs Xe | Xi Xs 
.70 | .72 | .74 
.67 | .69 | .70 


| 
.65 | ‘ 
.64 | .65 re 
a 
| 


88 | .77 | .71 | .69 | 
| 5 | 
.62 

64 
.66 


72 
74 


.63 | .64 
.70 





.60 61 | .63 











first factor: 
Qi =—- ae - a3 > wm 
and the loadings on the second factor: 


Bs = Be = B; c= Bs 


For the rest of it: 


< az< as 


as5< a6 
and 


B,=> B.> B3> Bs 


just as in the full-Woodrow structure. 
Correlationally, however, the two 
structures are very different. The full- 
Woodrow generates a _ superdiagonal 
form, while the half-Woodrow generates 
pattern. Table AS 
presents an eight-variable example. The 
correlations are high in the three corners 
of the matrix and drop off as one moves 
from any one of them toward the middle 
of the last column, the middle of the first 
row, or the middle of the superdiagonal. 
A three-cornered pattern is naturally 
divided into three submatrices: the two 
apical triangles, which contain the cor- 
relations within the two halves of the 
series, and the central block, which con- 
tains the correlations between them. In 
the two apical triangles the correlations 
are generated by augmented Spearman 
hypotheses. So, we already know how 
to fit correlations. In fact, we 
already have. The two apical triangles 
of Table A5 are the same two monotonic 
hierarchies which we treated in the first 
two sections of this note. Therefore, we 
already have tentative parameters for 


a three-cornered 


these 


the half-Woodrow structure, namely: 


= .71 Bi = .66 
76 B. = 58 

.83 B; = .41 

= .84 Bs = .32 
ag = .87 B = .00 


where @ represents the first four loadings 
(all equal) on the first factor and B rep- 
resents the last four loadings (again, all 
equal) on the second factor. 

These loadings, of course, were derived 
exclusively from the two apical triangles. 
To this point we have taken no account 
of the correlations in the central block. 
To do so, however, we need only generate 
the block correlations from our prelimi- 
nary loadings and see what sort of a fit we 
get. If the fit is good, our preliminary 
loadings become final. If it isn’t, we 
must adjust the factor loadings so as to 
compromise the errors which they gener- 
ate in the apical traingles and in the 
central block. In the case at hand the 
fit is not good. But the remedy is easy. 
All we need do is assign 8 a somewhat 
higher value than zero. If we let 8 equal 
.20 and keep all the other loadings just 
what they were, we get the theoretical 
correlations which appear in Table A6, 
and a good fit. Of course, it might not 
have been possible to fit the central block 
and the apical triangles with a single set 
of loadings. If this had been the case, 
we would have had to reject the half- 
Woodrow hypothesis. 


TABLE A6 
SoME THEORETICAL CORRELATIONS GENER- 
ATED BY A HALF—WooprRow Two 
FACTOR STRUCTURE 





X; | Xa Xs 
.77| 71 | .67 
74 | .69 | .66 

.63 | .62 
.60 
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FuLL—Wooprow Two- 
FACTOR STRUCTURE 


Fitting a full-Woodrow structure de- 
pends upon one fact: If a correlation ma- 
trix is from a full-Woodrow 
structure, it is always possible to trans- 
form the correlations into a_ three- 
cornered pattern and to fit the trans- 
formed correlations with a half-Woodrow 
structure. Suppose, for example, that 
we multiply the loadings a; and }; in a 
full-Woodrow structure by k;, where: 


derived 


for the last m/2 variables. The result 
will be a half-Woodrow structure, with 
the loadings (in the eight-variable case) : 


Bi = by 


>) 
bs 


) Be 
qa 
Bs os( = ) 
a3 


a 
a &= ».( ss) Bs = bs 


a 
And the correlations which this new 
structure generates, call them r;;’, will 
bear a simple relationship to the correla- 
tions of the original matrix, namely: 


rij) = rijkik; 


It follows that to fit a full- Woodrow struc- 
ture we must do three things: transform 
the superdiagonal form into a three- 
cornered pattern, fit the three-cornered 
pattern with a half-Woodrow, and then 
transform the loadings for the _half- 
Woodrow back to the original full-Wood- 
row structure. 

Our first task, then, is to find values 
for the kj which are inversely propor- 
tional to the full-Woodrow loadings, ai, 
de, a3, a4, and bs, be, bz, bs. The word 
“proportional” is important, because it 


means that we have a great deal of lati- 
tude in transforming the superdiagonal 
form; more specifically, it means that 
the transformed correlations may be big, 
small, or middling, without any loss of 
rigor. As long as the new correlations 
can be fit by a half-Woodrow structure, 
whatever their absolute magnitude may 
be, then we can fit the original correla- 
tions with a full-Woodrow structure. 
But even so, how do we find a set of &; 
which will do the trick? 

Consider the correlations in the central 
block of a three-cornered pattern. They 
take this form: 


Xs Xe X Xs 
X, (aas+ 881) (aas+ 881) (aa7+881) (acsy+ 81) 
X2 (aas+BB2) (aae+BB2) (aa7+ B82) (acy+ P82) 
Xs (aas+ 883) (aae+f83) (aa7+ P83) (aas+ffs) 
X4 (aas+BBs) (aas+BBs) (aa7+ BBs) (acs+P8) 


which the reader will immediately rec- 
ognize as a perfectly additive linear 
pattern, as these terms are used in analy- 
sis of variance. What we seek, there- 
fore, are values for the k; which will trans- 
form the central block of the superdiag- 
onal form into this pattern. Now, we 
know from the nature of the Woodrow 
structure that: 


as< as<S 7 


as 
and 


Bi> B2> Bs=> Ba 


which means that the biggest r;;’ in the 
central block will be in the upper right- 
hand corner. The correlations will fall 
off as we go either to the left or down, 
until we come to the smallest r;;’ in the 
central block, which will be in the lower 
left-hand corner. We also know, how- 
ever, that the correlations in the central 
block of a superdiagonal form: have just 
the opposite pattern: The biggest corre- 
lation is in the lower left-hand corner 
and the smallest in the upper right-hand 
corner. It follows that the multiplica- 
tive constants, k;, must obey these in- 
equalities: 


and 
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where the first four constants are multi- 
plied only into the rows of the central! 
block and the last four only into the 
columns. Knowing this much there is 
no great difficulty in transforming the 
correlations so that they have the pre- 
scribed pattern. 

Take the correlations in Table AS, for 
example. By now it is quite obvious 
that the correlations in Table A5 were 
derived from the original correlations for 
the Two-Hand Coordination Test. To 
produce the correlations of Table A5 
each correlation r;; in Table 1 was multi- 
plied by k; and again by k; where: 


k, = 1.17 ks = .81 
.86 
.88 
kg = .81 = .90 


All we have to do now, therefore, is to 
divide the loadings of the half-Woodrow 
structure, the a; and §;, by their corre- 
sponding k;; and we have our tentative 
parameters for the original full-Woodrow 
structure, viz. : 


a, = .61 s= 94 b, = 56 b= 
a,=./5 ; 97 be = 61 b= 
b; = .48 by 
by = 40 dg 


a3 oO. 95 


a4 : = .97 


These values do not conform entirely to 
the requirements of the full-Woodrow 
structure. 0, should not be less than bo; 
nor should a; be less than as. Also, the 
fit, though good, improved. 
However, we can make adjustments for 
these things; and when we do, we get the 
loadings which are cited in the text and 
the correlations which are reproduced in 
Table 2. 


can be 


DouBLE ACCUMULATION 
STRUCTURE 


If a correlation matrix is generated by 
a double accumulation structure, then: 


rig Pint (rig —Tin) + (rin Tin) 


If the fit is perfect, the relationship is 


exact. In other words, the correlations 
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generated by a double accumulation 
structure have the same additive linear 
pattern which we remarked in the central 
block of the half-Woodrow pattern. 
This circumstance makes it possible for 
us to fit the correlations under a double 
accumulation hypothesis without ever 
specifying the parameters of the struc- 
ture, i.e., the variances of the factors. 
Let’s take a few correlations from the 
matrix in Table 1 and see how well they 
satisfy this linear, additive relationship. 


We have: 


87 ~ .70 + (.07 + .12) = .89 
91 = .70 + (.04 + .06) 
91 = .70 + (.03 + .18) 

= .85 ~ .70 + (.01 + .16) = .87 


It's quite obvious from these relation- 
ships that Table 1 can be accounted for 
as a double accumulation structure. All 
we have to do is to treat the correlations 
in the first row and last column as theo- 
retical correlations and adjust them until 
we get a good fit, which, in the present 
case, is hardly much of a chore. Once 
they are made, we have the theoretical 
correlations, pij, which appear in Table 
3, of the text. 

It remains, of course, to determine the 
variances of the factors. In a double 
accumulation structure the first and last 
variables in the matrix have only one 
factor in common and that one factor is 
common to all of the variables in the ma- 
It follows that the variance of this 
one factor equals rin. Now, apart from 
this general factor, a double accumula- 
tion structure contains 2(m — 2) other 
factors, which are divided into two ac- 
cumulation structures, each one 
taining (m — 2) factors. It happens, 
however, that the theoretical correla- 
tions in the first row of a double accumu- 
lation pattern are determined by one of 
these structures only, and the theoretical 
correlations in the last column are 
determined by the other, exclusively. 
This means that we can reel off the factor 
variances in a very simple way, once we 
know the theoretical correlations. If the 
factors are numbered the way they are in 


trix. 


con- 
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the text, we get: 


Ts,” = Pit — Pis 


‘> = pis — Pir 


From these formulae I derived the values 
of the o;,2 which appear in the text. 


SIMPLEX 


In a Woodrow structure the param- 
eters are all of the same sort, namely 
factor loadings; similarly, in the double 
accumulation structure, the parameters 
are also of the same sort, in this case 
factor variances. But in the simplex 
there are two different kinds of structural 
parameters. Under a_ simplicial hy- 
pothesis, the theoretical correlations: 


P 
pis = — Vheh? 
Ci 


where the c, are the complexities and the 
h? are the communalities. This para- 
metric duality creates a corresponding 
duality in the fitting procedures: We can 
specify the communalities first or we can 
specify the complexities first. The older 
method is to specify the communalities 
first. So I will begin there. 


Communality Method 


Using the communality method we fit 
a simplex in the following manner. 
First, we specify tentative values for the 
communalities. Then, the correlations 
are corrected for attenuation (by the 
communalities) and the resulting, unat- 
tenuated correlations fit with a perfect 
simplicial form.4' If the unattenuated 
correlations won't submit to a simplicial 
fit, it may be that the variables which 


Al The term “perfect simplicial form” is 
defined on the next page. For a fuller dis- 
cussion of the simplex and its terminology, 
see Jones (1959, pp. 11-15). 


B. Jones 


enter into the matrix are not structured 
simplicially. It may be that we 
used the wrong communalities. So we 
adjust the communalities. If, by these 
adjustments, we can bring the unattenu- 
ated correlations into simplicial form or 
an approximation to it, we have our fit. 
If we can’t, then the simplicial hypothe- 
sis must be rejected. 


also 


It’s quite obvious from this account 
that a great deal hinges on the original 
choice of communalities. If the matrix 
has simplicial form, then sooner or later 
we will find a fit. But if the values we 
posit for the communalities are suff- 
ciently far off, it may be much later. It 
would be nice if we had some way of 
knowing at the beginning what 
communalities would turn out to 
Unfortunately, in advance of trying 
them out, the best we can do is to specify 
limits for the communalities. It would 
not be reasonable, for example, to set 
the communality of a variable higher 
than its known reliability. Nor would 
it be resonable to set it so low that the 
correlation between two variables, after 
correcting for the communalities, became 
greater than unity. But within these 
limits there is no way of knowing in 
advance what the best-fitting values for 
the communalities are. 

Nevertheless, we can still posit values 
for the communalities and try them out. 
And this is just what we do. In practice, 
there are three basic ways of proceeding. 
One, we can set the communality of the 
ith variable equal to the highest correla- 
tion it has in the matrix. Two, we can 
equate the communalities with one kind 
or another of reliability (if we have any 
information on this count). Or three, 
we can set the communalities all equal to 
a given value, .90, say, or .80. Noone of 
these procedures has any a priori claim 
to being right. But they do represent 
so many ways of approaching the prob- 
lem, and in my experience they are good 
ways. I have fit a great many simplexes, 
and one or another of these approaches, 
with minor adjustments, has always led 
to a good fit, if one existed. 

In fitting the matrix for the Two-Hand 
Coordination Test I set the communali- 


these 
be. 
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ties of the trials equal to the largest cor- 
relation they had in the matrix. The 
next step was to correct every correlation 
in the matrix for its attenuation by the 
remaining, specific variance. In other 
words, I divided r;; by Vhj;h?. The ma- 
trix of unattenuated coefficients which 
resulted appears in Table A7. 

Now, once the correlations have been 
corrected for attenuation the equations 
for the simplex become: 


=fitfet--:tfaithSsa 
fates fait fn 


or just what they were originally, except 
that the specific factors are gone. In 
this form, the simplex lays down very 
demanding requirements on the correla- 
tions. If the unattenuated correlations 
r,;', really do form a simplex, then they 
must obey the fundamental law of “‘per- 
fect simplicial form,”’ i.e. : 


’ ’ ’ 
Vik = ij Vik 


For example, if riz’ = .90 and re3’ = .80, 
then r;;’ must equal .72. Clearly, if 
this is the case, the matrix will have the 
superdiagonal form—and then some. 

Of course, in practice the unattenu- 
ated coefficients never do have simplicial 
form, not exactly. Take the correlations 
of Table A7, for example. ry2’ times 723’ 
equals .93, which almost equals ri3’ = 


TABLE A7 


UNATTENUATED EMPIRICAL CORRELATIONS 
FOR THE TWO-HAND COORDINATION TEST 
IN THE COMMUNALITY METHOD 


Trial] x.) Xe | Xs | Xe | Xs | Xe | Xr | Xs 


Xx, Lif 87; .86) .8: 83) .81 


1 es 91) .91 


XxX, 
‘ 98) . 92 


98 


TABLE A8 


UNATTENUATED THEORETICAL CORRELATIONS 
FOR THE TWO-HAND COORDINATION TEST 
IN THE COMMUNALITY METHOD 


Trial] Xi) Xe | X6 Xi Xs 
xX 93 | 90 | .88 | .86 |. 83| .83 
Xs | 90 | .90| .90 
X; | 93 | .93| .93 
X4 95 | .95| .95 
X; | 97| .97| 97 
Xe i 1.00 
X71 1.00 
Xs 























.91, but not exactly. Our task, there- 
fore, is to find a theoretical simplicial 
form which comes as close as possible to 
the unattenuated pattern. In the case 
of Table A7 this task is easy. The cor- 
relations, which appear in Table A8 have 
perfect simplicial form, and, as the reader 
can see, they constitute a very close ap- 
proximation to the unattenuated empiri- 
cal correlations. 

To obtain the theoretical correlations, 
ci;, which I presented in Table 4 of the 
text I simply multiplied the correlations 
in Table A8 by the same communalities 
which I used to unattenuate the empiri- 
cal correlations. 

But what about the complexities? 
Here we are, with the p;; already derived, 
and we still haven't said anything about 
the complexities. The answer, of course, 
is that, without realizing it, perhaps, we 
have already posited the complexities. 
The complexity of the ith trial is nothing 
other than its theoretical correlation, 
before attenuation, with the first trial in 
the series; in other words, cj = ¢;;._ In 
the case at hand, therefore, the complexi- 
ties are the correlations in the first row 
of Table A8.4? 


42 The virtue of the complexities is that, if 
we know them, then by the fundamental law 
of perfect simplicial form we know all of the 
ci; and can, therefore, if we also know the 
communalities, generate all of the theoretical 
correlations p;;. However, the complexities 
are not alone in possessing this property. 
The gradations, i.e., the correlations ¢;,;41, 
are also capable of generating the entire set 
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Covariance Method*® 


The full 


(with progressive simplification) are: 


b(fi tes + fa # si) 


equations for the simplex 


x= 


where the &; are arbitrary constants. In 
the usual presentation of the simplex, as 
in the text of this paper, these constants 
are dropped, since they are of no con- 
sequence as long as we have to do with 
correlations only. The moment, how- 
ever, that we concern ourselves with 
variances or with covariances we must 
dredge these little things up and be most 
meticulous about them. 

Under a simplicial hypothesis, the co- 
variances between the variables: 


covi; = kiki (os; + ---+0;,) 
Suppose, now, that we multiply all of 
where 


The 


oan <a ko 

the entries in the ith row by —, 
k; 

k, is another arbitrary constant. 
new entries: 


covi;! = kokj(os7---+ 07,2 


And if we multiply the columns by the 
same quantities, we get the entries: 


cov;;! = k 2(o;;7 + +--+ os," 


From here we get the unattenuated, the- 


oretical correlations cj; immediately, 
because : 


(72 +--- ++ o7,?) 


k 
The only problem is to find the P 


of c,; and so are several other subsets of the 
ci. For a discussion of this point, see Jones 
(1959, p. 12) 

43 The first person to mention to me the 
possibilities of the covariance matrix in fitting 
the simplex to a correlation matrix was Ardie 
Lubin. My original reaction was very nega- 
tive and in the “Simplex Theory” (Jones, 
1959, pp. 79-80) I had some rather hard 
things to say about his suggestion. I have 
no course now, therefore, but to apologize to 
him for my intemperance: he was right and I 
was wrong. 


MarsHALt B. Jones 


TABLE A9 


THE COVARIANCE MATRIX FOR THI 
Two-HAND CoorRDINATION TEST 


Trial] X:| X2 | Xs | Xe | Xe | Xe 


X,| | 323) 331 | 329 
X; | 477 | 484 
B 3 5 3 6 
A 4 
Xs 
> 
z 


This problem, happily, is easily solved. 
Once we have multiplied through the 


he then all of the entries in 
k; 

any one column are equal. 
we set all of the entries in the last column 
equal to some arbitrary amount A, by 
multiplying the rows by: 


A ko / y 
ah (,,= 4 
COVin k; \ kaos,” 


rows by the 


So suppose 


If the simplicial hypothesis is correct, 
then all of the entries in any one of the 
other columns should be equal too, or 
If this is not the case, we can 
If it is, 


close to it. 
reject the simplicial hypothesis. 


k, 
however, we have the z, We sought. 
‘ 


In Table A9 the covariance matrix for 
the Two-Hand Coordination Test is pre- 
sented. Setting all the entries in the last 
column equal to 300 (except for rounding 


> 


errors), we get these values for our 


k, k, 


. 51 

ky . ks 

k, k, 
3 49 = 

ks k 6 


.84 


When 


Note that we get no value for : 

8 
the rows are multiplied by these values, 
we get the matrix which appears in Table 
A10. And the columns conform 
approximately to the simplicial require- 
ments, we know that we are on the right 


since 
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TABLE A10 


THE COVARIANCE MATRIX FOR THE Two- 
HAND COORDINATION TEST AFTER MULTI- 
k, 


PLYING THROUGH THE Rows BY k 
‘ 


Trial} X: | X:| X Xe | Xs | Xe | Xr | Xs 
X, | 276 | 316 | 297 300 
X2 | 275 | 311 | 291 301 
X;3 | 273 | 315 | 293 | 298 
X, | 314 | 291 300 
Xs | 288 | 299 

| 303 

| 300 


track. Now, multiplying through the 
columns by these same values we get the 
matrix which Table A111. 
And from this table, taking the average 
down the columns, we get: 


appears in 


os,") = 154 


+ oy") = 141 


k2(oy2 ++ +++ 072) = 129 
ko? (a7,2 +o7,? + o7,?) = 123 
ko? (as,27 + o7,2) = 119 


From these figures we immediately ob- 
tain the matrix of theoretical, unattenu- 
ated correlation coefficients (minus the 
first row and the last column) which ap- 
pearsin Table Al2. It would have made 


TABLE All 


THe COVARIANCE MATRIX FOR THE Two 
HAND CooRDINATION TEST AFTER MULTI 
PLYING THROUGH THE ROWS AND THI 


COLUMNS BY — 
k; 


| 
Trial] Xi] X2 | Xs | Xe 
| 154] 142 | 135 | 
139 | 135 
: 4 134 


Xi } 
X2 
X3 
X, | 
Xs 
X¢ 
X7 
Xs | 


293 


no difference, by the way, if we had 
equated the rows first and then multi- 
plied through the columns. We would 
have ended up with the same values for 
the ¢;;. 

Let’s not worry for the time being 
about the first row and the last column 
and proceed directly to the communali- 
ties. If a simplicial fit exists, we know 
that: 


Vig * C43 Vh eh? 


If, therefore, we divide the 7r;; by our al- 
ready obtained c,;, the resulting matrix 
should be a Spearman unit hierarchy, 
where the loadings for the several vari- 


TABLE A1i2 


UNATTENUATED THEORETICAL CORRELATIONS 
FOR THE TWO-HAND COORDINATION TEST 
IN THE COVARIANCE METHOD 


| Lal : 
| Xs Xs Xe X: | Xs 


Trial] X:| Xs | 


.89 | .88 
.93 | .92 
95 | .94 
.97 | .96 








equal Vhi. this 


process we find that: 


ables Carrying out 


= .92 he = .94 
= .94 h? = .94 


It only remains to fill things in for the 
first and last variables. 
The correlations in the first row: 


‘ij = C12€2; VhYh? 
or 
{i ¢ 
‘ij => 11C2; Vh? 
if we set 


l, = C12 Vhi? 


In other words, given what we already 
know, i.e., C2; and Vh?, for2< j< n — 1, 
the empirical correlations r,; are gener- 
ated by a single parameter. In the same 
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way, the empirical correlations in the last 
column: 

vin = leCis¢n—1) Vh?? 
where 


ln = C(n—1)yn Vhie 


and finally 


= Lil ne 7 


is to obtain 
The most 


All we have to do, therefore, 
tentative values for /; and /,. 


natural way to do this is to set: 


l = =» 
Vh? 


Vitcn 1)” 


and then solve for /; and/,. In the pres- 


ent case, we get: 


Altogether, then, we can pick any values 
we like for ¢1. and h,*, provided only that: 
Ci2 vVh;? = .82 
And in the same we can pick any 
values we like for cz73 and hs*, provided 


way 


only that: 
C73 Vhs? = .97 

Suppose, therefore, that we set: 

12 = .90 C13 = .99 

h;? 81 h;? == .96 
Remembering that the complexities are 
simply the c;;, we now have both our com- 
munalities and our complexities, and we 


JoNEsS 


TABLE A113 


ATTENUATED THEORETICAL 
FOR THE TwWo-HAND COORDINATION 
IN THE COVARIANCE METHOD 


CORRELATIONS 
TEs! 


| | 
Trial x, | X: xX xX Xs L Xe | Xr 


72 | .70 | .69 

86 | .83 | .82 
£89 | .86 | .86 | . 
| 88 | .87 | . 

91 | .90 

| (92 


-] 
| | 
| | 
| | 
| | 





can generate our theoretical correlations. 
The pi; for the Two-Hand Coordination 
Test appear in Table A13, and, as the 
reader can see, we have a very good fit. 
Also, if the reader will check the param- 
obtained by this method with 
those obtained by the communality 
method, he will see that they are essen- 
tially the same. 

In conclusion, let me say a word 
about the comparative 
the communality and the covariance 
methods. Ordinarily, the communality 
method is quicker and involves many 
fewer computations. But it can go 
awry. If the communalities of the vari- 
ables happen not to coincide, 
proximately, with the tentative 
which are given us by one or another of 
our three methods of approach, we can 
waste an awful lot of time looking around 
for a fit. The covariance method, in 
comparison, is sure. If a fit exists. you 
can always get it with the covariance 
method. 


eters 


two merits of 


even ap- 
values 
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NORMATIVE AND IPSATIVE MEASUREMENT 
IN PSYCHOLOGY * 


DONALD M. BROVERMAN 


Worcester State Hospital, Massachusetts 


In an earlier paper (Broverman, 
1961) certain differences between fac- 
tors obtained from standard R factor 
analysis methods (correlating tests) 
and Q factor analysis techniques (cor- 
relating people) were illustrated and 
attributed to the different types of 
scores used by the two techniques. 
The usual FR analysis uses “normative” 
scores, or scores which reflect devia- 
tions of individual performances from 
the group’s mean level of performance ; 
while © factor analysis typically uses 
“ipsative” scores, or scores which re- 
flect deviations of particular perform- 
ances from each individual's mean level 
of performance. Normative scores, 
then, are distributed about a norm 
characterizing a group, while ipsative 
scores are based on norms of indi- 
viduals. 


The present paper will attempt to 
show that normative and ipsative pos- 


sibilities of measurement occur not 
only in factor analysis, but, also, in 
most efforts to quantify behavior, 
whether in purely theoretical terms or 
in actual experimental situations. Ac- 
cordingly, the assumptions involved in 
each mode of measurement, and exam- 
ples of these assumptions in current 
psychological theories will be pointed 
out. The paper will then examine the 
manner in which standard statistical 


1 Preparation of this paper was facilitated 
by the Dementia Praecox Research Project, 
Worcester State Hospital, and research 
grant (M-896), from the National Institute 
of Mental Health, United States Public 
Health Service. The writer prepared this 
paper while a United States Public Health 
Postdoctoral Fellow in Clinical Psychology. 
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procedures cope, or fail to cope, with 
these different aspects of behavior. 
With respect to this last point, the 
classical correlational procedures will 
be shown to confound normative and 
ipsative aspects of behavior. Hence, 
the final portion of this paper will pre- 
sent correlational techniques for deal- 
ing with these distinctions. 


ASSUMPTIONS OF THE NORMATIVE 
MEASUREMENT ORIENTIATION 


The normative measurement orienta- 
tion assumes that behavioral variations 
between individuals may be ordered on 
parameters which exist within the 
species’ universe of variation. Thus, 
normative measurement procedures 
distribute scores about the norm of a 
group, sample, or population. In 
normative measurement, then, the be- 
haviors of individuals are ordered on 
dimensions which transcend the indi- 
vidual, i.e., the behavior of a given 
individual is but a point or mark on a 
normative scale. This approach as- 
sumes that certain individual differ- 
ences in behavior are due to an entity 
or trait which all members of the popu- 
lation possess in varying amounts. For 
example, Spearman (1927) assumes 
that individual differences in intelli- 
gence reflect the different amounts of 
mental energy possessed by different 
individuals. 

It should be noted that intraindi- 
vidual variation in parts of normative 
traits are not considered in normative 
measurement. Accordingly, less than 
perfect correlations between several 
manifestations of the same normative 
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trait are accounted for by assuming 
(a) that the surface behaviors showing 
the trait are phenotypically complex 
manifestations of more than one un- 
derlying genotypical, normative trait, 
and/or (>) that part of the discrepancy 
is due to measurement error. Thus, 
normative dimensions are treated as 
unitary, internally undifferentiated 
variables whose variation occurs across 
individuals. 


ASSUMPTIONS OF THE IPSATIVE 
MEASUREMENT ORIENTATION 


The ipsative measurement orienta- 
tion, on the other hand, assumes that 
each individual is, himself, a universe 
within which variations in behavior 
occur. Accordingly, the different be- 
haviors of an individual are ordered on 
dimensions or scales within each indi- 
vidual, e.g., the habit strengths of dif- 
ferent responses in an individual may 
be scaled and compared. Particular 
behaviors of the individual, then, are 
assigned marks or scores on ipsative 
scales. 


The ipsative measurement approach 
frequently assumes that relationships 
between psychic entities in an indi- 


vidual determine his behavior. For in- 
stance, Spence (1954) argues that 
when incompatible response tendencies 
are simultaneously aroused in an indi- 
vidual, the 
strongest habit strength will be the one 
the individual will make 

It should be noted that often the 
ipsative orientation is not concerned 
with the unique aspects of single indi- 
viduals as in the idiographic method 
advocated by Allport (1937). On the 
contrary, ipsative relationships are fre- 
quently ascribed in theory to an entire 
population, i.e., the rules or laws be- 
tween internal psychic events may be 
generalized to all humans. Thus, it is 
not contradictory to seek commonly 


response possessing the 
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occurring ipsative relationships. In- 
deed, this is the goal of many personal- 
ity theories. 

The ipsative measurement orienta- 
tion, then, is concerned with quanti- 
tative relationships between psychic 
functions within individuals. It fre- 
quently assumes that such intrapsychic 
relationships play an important role in 
the determination of behavior. 
PROCEDURES 


IPSATIVE MEASUREMENT 


A property of all scales, both ipsative 
and normative, is that the scale scores 
must be in the same units. Hence, in 
ipsative measurement, the various be- 
haviors of a given individual must be 
expressed in terms of some common 
unit if scale comparisons are to be 
Numerous examples in psycho- 
logical theory illustrating this pro- 
cedure are available. For instance, 
Spearman (1927), who maintains that 
intelligence has certain subfunctions 
such as eduction of relationships, re- 
production, etc., posits that a large 
expenditure of mental energy in one 
subfunction leaves less energy available 
for other subfunctions. In this ex- 
ample, the common unit of the dif- 
ferent subfunctions is mental energy. 
The energy levels of the two subfunc- 
tions, therefore, may be scaled and 
compared. Similarly Freud (1935) 
posits conflict between different psy- 
chic institutions, e.g., id versus ego, the 
outcome of which depends upon their 
respective strengths. Once more the 
common denominator is some sort of 
mental energy. 

Thus, in ipsative scaling procedures, 
different behaviors are reduced to, or 
expressed in terms of some common 
unit, and vary around a norm estab- 
lished within the individual. The 
norm, of course, is also expressed in 
these same units. 

Different ways of establishing ipsative 


made. 
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norms within individuals exist. One 
method is by working with single, iso- 
lated individuals. For example, in 
Stephenson’s (1953) Q sort each indi- 
vidual produces his own distribution of 
responses to a set of stimuli. The 
typical unit of variation in such Q sorts 
is the degree of “likeness to me.” An- 
other example would be measures of 
physiological reactivity under different 
conditions which are compared to a 
base-rate of that activity. In these ex- 
amples, the norms are derived solely 
from a given individual’s behaviors. 
What other individuals do has no ef- 
fect upon the scales or norms so estab- 
lished. Hence, this may be called a 
“pure” ipsative measurement method. 

A second way of establishing ipsative 
norms within individuals starts with 
normative considerations. For in- 
stance, the performances of a number 
of subjects might be normatively 
ranked for each test in a test battery. 
A given subject would then have a 
distribution of normative ranks—one 
rank for each of the tests in the battery. 
The subject’s general level of perform- 
ance for the battery, i.e., the subject’s 
average normative rank standing, 
would then constitute the subject’s 
ipsative norm and deviations of the 
subject’s normative rank standings on 
particular tests from this ipsative norm 
would represent ipsative scores. The 
ipsative scores, in this case, are in units 
of deviations of group standings on 
various tasks from the subject’s aver- 
age group standing. Ipsative scores 
derived in this manner may be termed 
“double-centered” scores, i.e., they are 
centered or distributed first about a 
group norm, then about a norm of the 
individual. 

Thus, unlike either the pure ipsative 
method or the normative approach, 
the double-centered ipsative approach 
takes into consideration both normative 


and ipsative aspects of variation. 
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Mopes oF MEASUREMENT VERSUS 
PARAMETERS OF BEHAVIOR 


Measurement operations refer to cer- 
tain ways of assigning values to events 
or data on hypothetical parameters. 
Measurement-wise, however, one may 
develop several different techniques for 
establishing such values, or scores, on 
a given hypothetical parameter. Thus, 
one may distinguish between the con- 
cept of “modes of measurement” and 
the hypothetical parameters upon 
which these measurement techniques 
are intended to produce scores. The 
lack of equivalence between particular 
measurement or scaling procedures 
and the hypothetical parameters upon 
which score values are desired may be 
illustrated by considering a situation 
where certain modes of measurement 
are inadequate for the task at hand. 
For instance, we have previously men- 
tioned that the normative measurement 
orientation does not consider intra- 
individual variations in subdomains of 
normative traits. Similiarly, the pure 
ipsative approach does not attempt to 
work with normative variations be- 
tween individuals. However, if one ac- 
cepts the notion that a given set of per- 
formance scores may reflect both 
normative and ipsative sources of 
variation, then both of the above 
measurement orientations become in- 
adequate as procedures for assigning 
appropriate normative and _ ipsative 
scale values to the data. Rather, 
some method which takes into account 
both sources of variation should be 
employed. Thus, the hypothetical 
parameters on which the data are to be 
given scale values are seen to be con- 
ceptually independent or superordinate 
to the particular manner in which the 
various marks on the parameters are 
Hypothetical normative 
and ipsative behavioral parameters, 
then, represent two distinct and sepa- 


developed. 
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rate areas of concern for theories of 
behavior. It is not surprising that 
theories of psychology have progressed 
down both of these avenues of inquiry. 


NORMATIVE AND IPSATIVE 
PsyCHOLOGICAL THEORIES 


Modern psychology includes both 
normatively and ipsatively oriented 
theories. For instance, Krech’s (1949) 
theory that differences in intelligence 
are due to different rates of cortical 
conductivity seems, essentially, a norm- 
ative theory. The dimensions involved 
in Sheldon’s (1944) body types also 
seem to be true normative parameters, 
as do the various traits of personality 
with which Allport (1937) has been 
concerned. 

However, most theories concerning 
the mechanisms of psychological func- 
tioning assume that behavior is a 
function of relationships between psy- 
chological events occurring within a 
given individual. For instance, Hel- 
son’s (1948) Adaptation Level Theory 
suggests that a perceptual response is 
determined by the relationship of the 
present stimulus to previously experi- 
enced stimuli. Similarly, Werner 
and Wapner’s (1952) Sensory-Tonic 
Theory stresses the role of the relation- 
ship of sensory to tonic factors in de- 
termining perception. Thus, the above 
perceptual theories, many learning theo- 
ries (Hull, 1943; Thorndike, 1932), 
and general personality theories 
(Freud, 1935; Lewin, 1935) all em- 
phasize the importance of relationships 
between psychic events in determin- 
ing behavior. These theories, then, 
are essentially ipsative in nature. Of 
course, many theories may include both 
normative and ipsative considerations. 
For instance, while Freud (1935) is 
primarily concerned with the relation- 
ships between psychic institutions, he 


also postulates normative, between in- 
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dividual differences in endowments of 
libidinal energy. Conversely, while 
Spearman (1927) is perhaps best 
known for the normative aspects of his 
theory, i.e., differences in intelligence 
are due to different amounts of mental 
energy, his theory also includes 
formulations about ipsative ‘relation- 
ships between subfunctions of intel- 
ligence. These theories, and quite 
likely others, then, contain both norma- 
tive and ipsative formulations. 

It should also be noted that the 
normative-ipsative distinction is not 
represented in the language of indi- 
vidual differences. Individual differ- 
ences are usually thought of only in 
a normative i.e., in Anastasi 
and Foley (1949) differences between 
individuals are thought of as “a matter 
of degree. It is in this sense that indi- 
vidual differences are said to be quanti- 
tative rather than qualitative” (p. 59). 
However, this orientation tends to 
obscure the possibility that certain 
between-individual differences may be 
a function of varying ipsative, within- 
individual relationships, e.g., a psycho- 
analyst might argue that the differ- 
ences between psychotics and normals 
are due to different intrapsychic rela- 
tionships between ego and id. On the 
other hand, individual differences stem- 
ming from differing ipsative relation- 
ships can also be placed on normative 
scales, e.g., the above analyst might 
also say that one patient is more or less 
psychotic than another. Terminology 
is needed, then, to distinguish those 
individual differences which are due 
solely to variations on normative 
parameters from individual differences 
stemming from differing ipsative rela- 
tionships. Hence, it is proposed that 
the purely normative individual dif- 
ferences be termed “first order” norma- 
tive differences, while those individual 
differences stemming from ipsative, 
within-individual relationships might 


sense, 
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be termed “second order” normative 
differences. 

It seems, then, that there are two 
legitimate, but independent realms of 
inquiry for a general psychology of 


individual differences. These are (a) 


the establishment of those normative 
parameters of differences within the 
species that are not merely pheno- 
typical expressions of ipsative internal 
relationships, and (b) the establish- 
ment of ipsative parameters of internal 


organization. As already demon- 
strated empirically, the generalizations 
that hold at one level of analysis can 
differ from those that hold at the 
other level (Broverman, 1960, 1961; 
Mandler, 1959). 


NORMATIVE AND IPSATIVE 
MEASUREMENT IN STATISTICS 


Relatively little explicit attention is 
currently given to the distinction be- 
tween normative and ipsative measure- 
Consequently, numerous in- 
applications of 

For instance, 
prefer 


ment. 
consistencies in the 
these orientations exist. 
psychologists seem to 
approaches to research 
many, if not 
present day psychological 
lean heavily on_ ipsative 
The development 


most 
normative 
problems even though 
most, ol 
theories 
conceptualizations. 
of statistical procedures for separating 
these different aspects of behavioral 
data has also been uneven. Some 
statistical techniques discriminate be- 
tween normative and ipsative aspects 
of data while others do not. The fol- 
lowing two hypothetical examples, in- 
volving typical research problems and 
statistical procedures, illustrate (a) an 
appropriate separation of normative 
from ipsative aspects of data, and (b) 
a confounding of these two types of 
information. 

2An outstanding exception is Cattell’s 
(1944) paper concerning the use of norma- 
tive and ipsative scores in psychology. 
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The first example involves assessing 
the significance of the difference be- 
tween two means. An _ investigator 
wishes to determine whether motivat- 
ing instructions improve performances 
on a particular task. Hence, the ex- 
perimenter administers the task under 
a motivated and a nonmotivated condi- 
tion to the same group of subjects. 
Assuming sequence effects are con- 
trolled, the experimenter might pro- 
ceed to assess the significance of the 
difference between condition means via 
an analysis of variance technique. 
Since the experimental design employs 
repeated measurements, the total vari- 
ance would first be partitioned into 
and “within” individual 
sources of variation. The within 
individual variance would then be fur- 
ther divided into that associated with 
variation in conditions, and that which 
is not. After division by the appropri- 
ate degrees of freedom, the ratio of the 
two portions of within-individual vari- 
ance would constitute an F test. If 
this test attains the required level of 
significance the experimenter is able to 
say that “The mean performance under 
the motivated condition is significantly 
different from the mean performance 
under the nonmotivated condition.” 

This statistical procedure clearly 
separates the normative from the 
ipsative aspects of the data. The 
between-individual variance _ reflects 
normative interindividual differences 
in task ability, independent of condi- 
tions, and is appropriately discarded 
since this variance is not the point of 
inquiry. The experimenter is not con- 
cerned with ability differences between 
subjects. He wishes only to-know the 
effects of conditions on the subjects 
after such ability differences are con- 
trolled. The within-individual vari- 
ance, which reflects ipsative or intra- 
individual variation, on the other hand, 
reveals that the experimental conditions 


“between” 
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had significant effects upon perform- 
ance when these effects were measured 
against each subject’s own norm. This 
statistical procedure, then, sensitively 
discriminates between normative and 
ipsative aspects of data. 

In contrast, correlational analyses 
typically give little attention to the 
normative-ipsative distinctions in be- 
havior. Consequently, correlational 
procedures tend to confound inter- 
individual with intraindividual varia- 
tions in performance scores as illus- 
trated in the following example: 

The experimenter wishes to know 
the correlations between several tasks. 
Accordingly, he administers each task 
to each subject in his subject sample 
and correlates the various scores. 

Table 1 presents hypothetical stand- 
ardized scores for 12 subjects on four 
tasks. Table 2 shows that the Pearson 
product-moment intercorrelations of 
these tasks are positive, ranging from 
.69 to .93. This standard correlational 


procedure accepts these correlations as 
the relationships between test perform- 


ances, which, indeed, they are. How- 

ever, in the light of the previous dis- 

cussion, we know recognize that these 

relationships between test perform- 
TABLE 1 


HYPOTHETICAL PERFORMANCE 
STANDARDIZED SCORES 


DATA 


Mean 
standard 
score 


Subject | Test 1 Test 2 Test 3 Test 4 


67 
62 
58 
55 
53 
51 
49 
47 
45 
42 
38 
33 
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rABLE 2 


INTERCORRELATION OF TESTS 


| 
Test Test 2 Test 3 


Test 1 93 69 
Test 2 69 
Test 3 
rest 4 


Note 


Decimals have been omitted 


ances are a function of two sources of 
variation: (a@) normative differences 
between individuals in general ability 
levels, and (b) within-individual ipsa- 
tive variations in abilities on different 
tasks. The correlation coefficients in 
Table 2, then, are confounded result- 
ants of these two sources of variation. 
This is the same confounding that the 
experimenter took pains to avoid when 
assessing the difference between condi- 
tions means. The analogue of the 
mean difference technique in correla- 
tional procedures would be to eliminate 
normative differences in general ability 
level between individuals before exam- 
ining the intraindividual variations that 
occur as a function of differences in 
task requirements. The more precise 
experimental question would then be 
“What are the relationships between 
abilities on different tasks when dif- 
ferences in general ability levels be- 
tween subjects are held constant or 
statistically controlled?” <As_ will 
shown below, these relationships may 
be quite dissimilar to those obtained 
from the usual confounded method. 


be 


Our problem is to separate the vari- 
ance attributable to normative differ- 
ences between subjects from that due 
to within-individual variations in ability 
to perform different tasks. 

Normative variance between sub- 
jects occurs when some subjects tend 
to be generally above the group mean, 
while others tend to be below this refer- 
ence norm. However, if the task per- 
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TABLE 3 


HYPOTHETICAL PERFORMANCE DATA TRANS- 
FORMED INTO IPSATIVE SCORES 
(X; — X;) 


MUU 
nrNeuae 
aan 


wun 


formances of each subject are ex- 
pressed as ipsative scores, or deviations 
from each subject’s mean level of per- 
formance, then all subjects will have 
the same status with respect to the 
new reference norms, i.e., the sum of 
the deviations above each ipsative norm 
will equal the sum of the deviations 
below for each subject. Hence, norma- 
tive differences between subjects are, 
in effect, removed via ipsatization of 
performance scores. Correlation of the 
ipsative scores will then reveal the in- 
trapsychic relationships between abili- 
ties on different tasks, free from the 
influence of differences in general 
ability between subjects. 

The mean standard 
subject is shown in Table 1. 


score of each 
Table 3 
shows the ipsative scores (X; — X,) of 
each subject on each task. Table 4 
intercorrelations of these 
ipsative scores. It will be noticed that 
some of the correlations are now nega- 
tive, others positive. Thus, controlling 
normative interindividual differences in 
general ability levels considerably alters 
the correlational relationships between 
tasks, and, of course, any conclusions 
that might be drawn from these data. 


shows the 
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Instances where such controls have 
actually affected the results of studies 
have been reported by Mandler (1959) 
and Broverman (1960, 1961). 

It seems clear then, that correla- 
tional analysis, a classical research tool 
in systematic psychology, has suffered 
from a lack of sensitivity to the 
normative-ipsative distinctions in be- 
havior. Standard R factor analysis 
technique, which is an elaboration of 
the correlational method, suffers in the 
same manner, i.e., the factors extracted 
by these techniques are confounded ex- 
pressions of normative and ipsative 
variation. The confounding in R factor 
analysis can be overcome in the same 
manner as illustrated above, i.e., by 
ipsatizing the various performance 
scores before starting correlational and 
factoring procedures. As in the above 
correlational example, factors derived 
from ipsative scores will frequently 
have large negative loadings or correla- 
tions with tests. A discussion of the 
numerical mechanics underlying this 
phenomenon may be found in Brover- 
man (1961). That paper also pointed 
out that an ability factor defined by 
both positive and negative loadings can 
be viewed as a “choice point” in de- 
velopment since at some point in each 
individual’s development a decision 
must be made as to whether he is to 
specialize in one or the other of the two 
classes of behaviors defining the factor. 
Other investigators (Cattell & Bag- 
galey, 1956; Sweney, 1959) have inter- 
preted this specialization or directional 


TABLE 4 


INTERCORRELATIONS OF IPSATIVE SCORES 


Test 


Test1 | 58 
Test 2 


Test 2 Test 3 


—79 
= 


Test 4 


—79 
— 79 


Test 3 58 


Note.—Decimals have been omitted. 
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aspect of ipsative scores as an index 
of motive strength. 


RELATIONSHIP BETWEEN NORMATIVE 
PARAMETERS AND PARAMETERS DE 
RIVED FROM DouBLE-CENTERED 
IPSATIVE SCORES 


The relationship between dimensions 
derived from ipsative scores distributed 
about a normative base-line, and the 
normative dimension itself is worthy of 
particular note. Such ipsative dimen- 
sions are internal subdivisions or dif- 
ferentiations of the larger normative 
dimension. Again, Spearman’s (1927) 
theory of intelligence illustrates this 
point. Spearman argues that intel- 
ligence varies in units of mental energy. 
Normatively, then, the subjects differ 
from each other in terms of the amount 
of mental energy possessed by each. 
However, within intelligence, as men- 
tioned earlier, Spearman posits sub- 
eduction, reproduction, 
etc., the strength of which are 
measured in terms of mental energy 
and which vary ipsatively within each 
individual. These subfunctions, then, 
are internal differentiations within the 
larger normative trait of intelligence. 

This same part-whole relationship 
between ipsative and normative factors 
is obtained when the ipsative dimen- 
are empirically derived from 
double-centered ipsative scores via fac- 
tor analysis. For instance, a subject's 
average rank performance for a battery 
of cognitive tests might be interpreted, 
normatively, as an indication of his 
general level of intelligence. On the 
other hand, the factors emerging from 
an analysis of the ipsative scores dis- 
tributed about the average rank stand- 
ings of subjects would represent in- 
ternal differentiations of that normative 
trait. 

This part-whole relationship of ipsa- 
tive to normative factors may be con- 
trasted with the relationships between 


functions of 


also 


sions 
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factors produced by the usual norma- 
tive R factor analysis procedures. 
Factors produced by this latter method 
are discrete, and juxtaposed externally 
to each other. Even though normative 
R factors may be correlated with each 
other, one normative factor is not 
thought of as a subdivision of another, 
as in the double-centered ipsative ap- 
proach. 

As pointed out by Stephenson 
(1953), factor analysis is typically used 
in an exploratory way to determine the 
various interdependencies between 
psychological functions or behaviors. 
When statistical procedures are used 
in this exploratory manner, one tends 
to allow the model embodied in the 
statistical procedures to guide one’s 
conception of the organization of the 
behavior reflected in the data. Thus, 
the 30 years of research employing 
normative R factor analysis, with its 
model of externally juxtaposed factors, 
has left its mark on present conceptions 
of mental functioning, e.g., Burt’s 
The Factors of the Mind (1946), 
Thurstone’s Primary Mental Abilities 
(1938). While the part-whole model 
embodied in double-centered ipsative 
measurement is not new on the psy- 
chological scene—witness Spearman’s 
(1927) theory of intelligence—a clearer 
recognition of its properties may exert 
a correspondingly greater influence 
upon our conceptions of the organiza- 
tion of behavior. 


DETERMINATION OF APPROPRIATE 
NorMATIVE BASE-LINES 


A major problem in double-centered 
ipsative measurement is the identifica- 
tion of appropriate normative base- 
lines from which to develop ipsative 
So far, we have assumed that 
a single normative base-line may be 


scores. 


used for a given matrix of data. This 
assumption is implicit in the analysis 
of variance repeated measurements de- 
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However, it is possible that 
than one normative variable, 
more than one base-line, is 
actually embedded in a given matrix 
of data. In this case, certain behaviors 
ought to be ipsatively measured against 
one base-line, others against another. 
Hence, a method for objectively estab- 
lishing the presence of normative vari- 
ables is needed. This problem may be 
resolved via the following considera- 
tions: 

Any single performance score may 
be thought of as a resultant of norma- 
tive and ipsative sources of variation. 
Consider, now, a situation where sev- 
eral tests all reflect a single normative 
trait, but some of these tests also reflect 
one ipsative dimension, other of these 
tests reflect a second ipsative dimen- 
sion, still others reflect a third ipsative 
dimension, etc. Each test score of a 
given subject, then, consists of a con 
stant representing his status on the 
normative dimension and a_ variable 
representing the standing of the task 
on a particular ipsative dimension. 
Since this normative constant varies 
between subjects, it tends to produce 
positive relationships between tests. 
The ipsative components of the various 
performance scores, on the other hand, 
will either correlate positively when 
they reflect the same ipsative dimen- 
sion, or not correlate when they reflect 
different ipsative Still 
other ipsative components may be in- 
versely related and work towards nega- 
tive relationships. Thus the normative 
components tending to produce posi- 
tive relationships between tests will 
summate with those positive ipsative 
covariations, but will be weakened or 
canceled out by those independent and 
negatively varying ipsative components. 
The first order (simple structure) 
factors extracted from performance 
scores, then, will be those tests whose 
ipsative components have positive rela- 


sign. 
more 
hence, 


dimensions. 
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tionships augmented by their positively 
covarying normative components. 

Since each of these factors have the 
same normative components, only the 
variation of the ipsative components 
can be acting to produce the different 
factors. Also, since each factor has 
the same normative component, these 
factors should correlate with each 
other. Accordingly, the factor inter- 
correlations ought to produce a second- 
order factor reflecting the presence of 
the normative trait. Of course, if the 
data actually contained more than one 
normative trait, then more than one 
second-order factor would be obtained. 
The number of second-order factors 
obtained from a factorial solution of 
performance data, then, indicates the 
number of normative dimensions pres- 
ent in the data. In turn, each norma- 
tive dimension calls for its own base- 
line from which to assess ipsative 
variations of the tests included in the 
second-order factor. Thus, the mean 
group standing of all tests included in 
each second-order factor would be ob- 
tained for each subject. Deviations of 
individual tests from these means 
would then reflect ipsative variations 
within a given normative dimension. 
Other methods of obtaining appropri- 
ate -base-lines for ipsative oriented 
research problems may be found in 
3roverman (1960). 


CONCLUSION 


It may be well to conclude by at- 
tempting to see the issues discussed in 
this paper in a broader perspective. It 


seems that the measurement of be- 
havior is more complicated than pre- 
viously suspected. One wonders why 
this should be so; why one can not 
proceed to a more straightforward 
measurement of behavior. The reasons 
seem at least three-fold: 

First, human behavior is so bewil- 
deringly varied and multifaceted that 
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few psychologists are willing to stay at 
the level of raw, uncategorized, ob- 
Such an en- 
infinity 


servations of behavior. 
terprise would produce an 
of parameters, variables, and _ scales. 
Rather, one typically seeks to reduce 
a large number of different behaviors 
to a smaller number of common under- 
lying parameters. The pertinent vari- 
ables of behavior, then, are not acces- 
sible for direct measurement. One 
must fractionate, peel off, and discard 
the surface aspects of different be- 
haviors in order to uncover and obtain 
an estimate of the underlying variables. 
Thus, in measuring behavior, pheno- 
typical surface appearances are dis- 
carded in favor of inferred genotypical 
variables. 

A second aspect of human behavior 
which contributes to the complexity pf 
its measurement is that any given be- 
havior can simultaneously represent 
more than one underlying variable. A 
given behavioral act may be a com- 
posite expression of many different 
variables. Thus, one must separate 
not only phenotype from genotype, but 
also genotype from genotype. 

Finally, we must recognize that all 
human behavior potentially participates 
in two different realms of human varia- 
tion, i.e., species and individual. Each 
behavioral act is, potentially at least, a 
simultaneous expression of the human 
species’ as well as of the individual’s 
repertoire of variation. It is this dual 
aspect of behavior which produces the 
particular normative-ipsative model 
elaborated in this paper. Failure to 
represent this fact in measurement and 
statistical operations must lead to a 
confounding of these two different 
aspects of human existence and varia- 
tion. If one accepts the proposition 
that the laws or regularities which 
characterize variations in behavior be- 
tween individuals are not the same as 
those which characterize variations in 
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behavior within individuals, then such 
confounding is to be studiously avoided. 
Hence, an orderly inquiry into the 
nature of human behavior requires a 
stringent separation of the between- 
individual species parameters from the 
ipsative intraindividual organizational 
aspects of behavior in order that the 
separate relational networks of each 
realm may be established. While this 


may involve considerable more work 
than previous methods, the results may 
be proportionately more rewarding. 
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FRUSTRATIVE NONREWARD IN PARTIAL 
REINFORCEMENT AND DISCRIMINATION LEARNING: 


SOME RECENT 


HISTORY AND A THEORETICAL 


EXTENSION ! 


ABRAM AMSEL 


University of Toronto 


The overall objective of this paper 
is to examine the current status of a 
of frustrative nonreward in 

theory. The examination 
takes two forms: a general discussion 
and survey of recent work on 
tivating and inhibiting properties of 
nonreward, followed by a theoretical 
frustrative-nonreward 
prediscrimination 


concept 
behavior 


mo- 


extension of 
theory to certain 
phenomena. 

In the first part of the paper a brief 
historical account will be presented of 
recent changes toward an “active” 
conceptualization of nonreward, par- 
ticularly in neo-Hullian theory, the cur- 
rent status of such an active nonreward 


concept in explanations of partial re- 


will be set in a 
partial 


some 


inforcement effects 
broader framework of 
forcement theories, and 
data will be presented in support of the 
idea that discrimination learning in- 
volves frustrative-nonreward effects. 
The second more theoretical portion 
of this paper from an earlier 
frustrative conceptualization of non- 
reward (Amsel, 1958). It represents 
an attempt to extend frustrative-non- 
reward theory to situations in which 
discrimination learning is preceded by 
various amounts and kinds of “prior 
These prior experiences 


rein- 
recent 


stems 


experience.” 


1A shorter version of this paper was pre- 
sented to the Psychology Colloquium of the 
University of Illinois in May 1961. The 
preparation of this paper and much of the 
experimental which it refers 
were supported by grants and 
G-13895) from the National Science Founda- 
tion. 


research to 


(G-5527 
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will be termed “prediscrimination 
treatments” and will involve training 
to approach in the presence of one or 
both (separately) of the eventual 
discriminada, under conditions of 
partial or of continuous reinforcement. 


History 


Hull’s adoption of what he termed 
the Mowrer-Miller hypothesis as a 
basis for a two-factor theory of inhibi- 
tion (Hull, 1943) was, at the same 
time, the acceptance of a “passive” con- 
ception of nonreward and rejection 
of a more active conception which 
he and others had held. Certainly, 
Pavlov’s nonreinforcement, producing 
internal inhibition, was an active con- 
ception. Spence’s (1936, 1937) theory 
of discrimination learning, clearly in 
the Pavlovian tradition, also assigned 
special inhibitory properties to non- 
reinforcement. In some of his more 
recent systematic writing, Spence 
(1956) points out that his position on 
the Mowrer-Miller hypothesis was 
never in agreement with Hull’s. 

Emotional-motivational effects of 
nonreward, as they relate to learning, 
were described in an article by Miller 
and Stevenson (1956), which attrib- 
uted the agitated behavior of rats dur- 
ing extinction to effects of nonrein- 
forcement carried over from one trial 
to the next. This is an example of 
an active motivational conception of 
nonreward. A similar emotional-moti- 
vational interpretation of nonreward 
can be found in Skinner (1938). 

In Hull’s 1943 theory, which domi- 
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nated the psychology of learning in the 
forties, nonreinforcement was an event 
without direct inhibitory or motiva- 
tional effect as it was also in the 
early mathematical-model treatments 
of learning (Bush & Mosteller, 1951; 
Estes, 1950). By about 1950, how- 
ever, expressions of dissatisfaction 
with the passive theoretical status of 
nonreward were emerging. These ex- 
pressions were in theoretically oriented 
experiments and in theoretical papers. 
Most of the early defections from 1943 
Hull, in this connection, were by per- 
sons close to the Hullian point of view. 
There are suggestions that Hull, him- 
self, was getting ready to change his 
position on this shortly before his 
death (Hull, 1952, Ch. 5). 

Rohrer’s (1949) treatment of ex- 
tinction in terms of a “frustration 
drive” was one of the first expressions 
of a desire to give nonreinforcement a 
more active role in inhibition.2? How- 
ever, while Rohrer’s reference to frus- 


tration was specifically related to the 
operation of nonreinforcement, it pro- 
vided no particular mechanism for 


frustrative inhibition. At about the 
same time, the well-known experi- 
ments of Virginia Sheffield (1949, 
1950) appeared, relating partial re- 
inforcement acquisition and extinction 
effects to the spacing of trials and 
containing suggestions of both frustra- 
tion drive and frustrative inhibition. 
These articles and another by Stanley 
(1952), along with a late experimental 
paper by Hull and others (Hull, 
Livingston, Rouse, & Barker, 1951) 


2It is questionable that “inhibition” is the 
appropriate and correct term to use in re- 
gard to response decrement related to frus- 
trative nonreward. Better terms would be 
“interference” or “competition,” since frus- 
trative inhibition cannot be regarded as di- 
rectly affecting the excitatory tendency out 
of which it grows. It is rather a new ex- 
citatory tendency which competes with the 
older one. 
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and Hull’s (1952) last book, all con- 
tained at least the germ of a change 
toward a more active conception of 
nonreinforcement. An experimental 
paper by Denny and Dunham (1951) 
offered inhibition based on nonrein- 
forcement (frustration) as an alterna- 
tive to reactive inhibition to account 
for differential nonreinforcement ef- 
fects in a T maze. 

Brown and Farber (1951) outlined 
a theory in which nonreward was one 
of several antecedents to frustration. 
In this paper, the major emphasis was 
on the definition of frustration in terms 
of competing response tendencies and 
the authors were concerned only with 
the motivational (drive) properties of 
frustration, and not with inhibitory 
effects. In the same year, a paper by 
Amsel (1951) made the point that 
fractional anticipatory frustration (rg), 
which is the classically conditioned 
form of the frustrative reaction to non- 
reward, provides a mechanism for con- 
ceptualizing the active properties of 
nonreinforcement and should be re- 
garded as a determiner of inhibitory 
effects. And, at about the same time, 
Seward (1951) published a theoretical 
paper which mentioned, without elabo- 
ration, the possible functional proper- 
ties of an anticipatory frustration 
factor. 

The remainder of the decade, 1950 
to 1960, has seen a number of addi- 
tional experimental and theoretical at- 
tacks on the problem of nonreward. 
On the one hand there are experiments 
and theoretical treatments by investi- 
gators in the Hullian tradition of be- 
havior theory which have sought to 
extend and revise Hull’s theory in re- 
gard to the status of nonreward. The 
other hand holds a number of published 
studies representing a variety of theo- 
retical and nontheoretical interests 
which have contributed information 
about frustrative nonreward. 
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Much of this work has been re- 
viewed earlier (Amsel, 1958; Lawson 
& Marx, 1958) ; but some of it is very 
recent. The experiments fall into 
three categories: (a) demonstrations 
of the motivating effects of frustrative 
nonreward, i.e., the carried-over, en- 
hancing effect of nonreward on the 
vigor of immediately following be- 
havior. This has been termed the 
Frustration Effect (FE). Several 
studies, by far the greatest number, 
have been directed not only at demon- 
strating the effect but also at examin- 
ing the variables determining its occur- 
rence and strength (Amsel, Ernhart, & 
Galbrecht, 1961; Amsel & Hancock, 
1957; Amsel & Penick, 1962; Amsel 
& Roussel, 1952; Bower & Stocks, 
1960; Longstreth, 1960; Marzocco, 
1951; Penney, 1960; Roussel, 1952; 
Seward, Pereboom, Butler, & Jones, 
1957; Wagner, 1959). (b) The sec- 
ond category consists of experiments 


which are mainly concerned with the 
frustration drive stimulus, but which 
also touch on frustration reduction as 
reinforcement (Amsel & Prouty, 1959; 
Amsel & Ward, 1954; Tyler, Marx, 


& Collier, 1959). (c) And finally, 
there are a few studies which deal with 
the inhibitory properties of frustrative 
nonreward. The inhibitory mecha- 
nism is fractional anticipatory frustra- 
tion (rr), a classically conditioned 
form of the goal response to frustrative 
nonreward. The mechanism and its 
theoretical properties have been identi- 
fied in detail (Amsel, 1958; Spence, 
1960), and experiments testing impli- 
cations of such a mechanism are avail- 
able (Bower, 1961; Goodrich, 1959; 
Haggard, 1959; Wagner, 196la, 
1961b). Of particular importance in 
this connection are a series of experi- 
ments by Wagner (1961b) which go 
far toward establishing the proposition 
that frustrative nonreward has many 
of the properties of punishment, and 
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that rp operates in many respects like 
fear. These experiments would support 
such statements as (a) the greater the 
strength of frustration the faster the 
extinction (avoidance) of a nonre- 
warded response; (>) cues paired with 
frustrative nonreward acquire motiva- 
tional properties ; and (c) the cessation 
of cues previously paired with frustra- 
tive nonreward serves to reinforce a 
new response. 

There are several other experiments 
on nonreward (or reward prevention) 
as a variable in learning which provide 
support for nonreward as a factor in 
inhibition. For example, studies by 
Lambert and Solomon (1952) and by 
Adelman and Rosenbaum (1954) 
related extinction to the blocking of 
instrumental behavior at various dis- 
tances from the goal. An experiment 
by Adelman and Maatsch (1955) 
found that resistance to extinction de- 
pends to some extent on what the sub- 
ject is allowed to do when it finds no 
reward in the goal box, ie., is frus- 
trated. The suggestion is that resist- 
ance to extinction is high when the 
subject makes a response to frustration 
which removes it from the frustrating 
situation rather than one which keeps 
it in the apparatus. Ferster (1957, 
1958) found that stimuli signaling 
“time out” from positive reinforce- 
ment acquire aversive properties. He 
also showed that the withdrawal of a 
positive conditioned reinforcer had the 
functional properties of punishment, 
suppressing the rate of responding 
much like the presence of a negative 
reinforcer. These experiments point 
clearly to the ‘inhibitory (aversive) 
properties of nonreward. 

Some recent experiments, which 
show decreasing resistance to extinc- 
tion with increasing numbers and 
magnitudes of reinforcement, must be 
regarded as supporting a conception 
of nonreward as an active factor. For 
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example, Hulse (1958) and Armus 
(1959) found that larger magnitudes 
of reinforcement in acquisition are 
followed by faster extinction. Very 
recent reports by Reynolds and Siegel 
(1961), by Pavlik (1961), and by 
Wagner (196la) were to the same ef- 
fect. North and Stimmel (1960) have 
shown that rats reinforced 90 or 135 
times extinguish faster than those re- 
inforced 45 times. Similar results 
have been reported by Reynolds, 
Richter, and Carlock (1960). They 
found faster extinction following .5- 
than following .l-gram reward, and 
faster extinction after 66 acquisition 
trials than after 6, 12, or 30 trials. 
This kind of result is quite in accord 
with results from studies of overlearn- 
ing and discrimination reversal, which 
we will look at later. Together, they 
are as incompatible with early Hullian 
theory, or any other theory requiring 
monotonicity and positive relationships 
in these variables (e.g., the various 
mathematical approaches), as were the 
early partial reinforcement findings. 
We will come back to some of these 


experiments later, and my discussion 
of partial reinforcement effects will 
show why a theory which includes a 
frustrative nonreward hypothesis must 
predict such results. 

There are by now several experi- 
ments which can be taken to indicate 


that nonreinforcement is im- 
portant than reinforcement in discrimi- 
nation. Chronologically, these are 
studies by Fitzwater (1952), Grove 
and Eninger (1952), Eninger (1953), 
Shoemaker (1953), Cantor and Spiker 
(1954), Grice and Goldman (1955), 
Birch (1955), and Solomon (1956), 
to mention some of the earlier ones. 
A very recent study which makes this 
same point has been reported by 
Lachman (1961). This is not the 
place for a detailed description of these 
studies; however, the burden of evi- 


more 
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dence is that avoidance of nonreward 
is a more powerful factor in discrimi- 
nation than approach to reward; and 
this is, of course, very relevant to a 
point of view which makes nonreward 
a determiner of inhibition. Perhaps 
Harlow and Hicks (1957) were re- 
acting not only to their own experi- 
mental findings but also to the weight 
of this kind of evidence when they 
argued for a “uniprocess” rather than 
a “duoprocess” theory of discrimina- 
tion learning, the single process being 
inhibition. The weakness of their 
argument is that the negative process 
(ry, in my terms) may well depend on 
the prior existence of a positive process 
(see Amsel, 1958; Spence, 1936, 
1937), and one is back to two proc- 
esses again. 

Other examples of a return to an 
active conception of nonreward outside 
of the Hull-Spence camp can be found 
in the recent work of Mowrer and 
Estes. Mowrer, in his Learning 
Theory and Behavior, makes frequent 
use of the notion of nongoal events 
arousing “frustration” or “anger,” in 
addition to “disappointment” and ex- 
presses agreement with the position I 
favor in regard to frustrative goal 
events (Mowrer, 1960, p. 409). 
Estes (1959) has recently acknowl- 
edged that statistical learning theory 
must cope with the problem of an 
active negative process based on non- 
reward in these words: 


... 1 have found that in the naive animal 
an unreinforced trial produces no apparent 
change in response probabilities, but that 
after a series of reinforced runs to a given 
side of a T maze, the introduction of non- 
reinforcement yields a decrement in proba- 
bility of response to that side. . . . Appar- 
ently under partial reinforcement schedules, 
the effect of nonreinforcement varies be- 
tween these extremes...” (p. 417). 


From this brief sketch of the recent 
history of the concept of nonreward, 
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we move to a description of the role 
of active conceptions of nonreward 
in interpretations of noncontinuous 
(partial) reinforcement effects. 


NONREWARD IN INTERPRETATIONS OF 
PARTIAL REINFORCEMENT 


In his 1958 chapter on learning, 
Lawrence differentiates between “in- 
tertrial” and “intratrial” interpreta- 
tions of the Partial Reinforcement 
Effect (PRE)—the finding that partial 
or intermittent reinforcement in learn- 
ing leads to greater resistance to 
extinction than does continuous rein- 
forcement. Intertrial explanations at- 
tribute the PRE in one way or another 
to immediately carried-over traces of 
stimulation from one trial to the next 
in acquisition. According to such inter- 
pretations, it is on the basis of such 
traces that acquisition is more clearly 
discriminated from extinction by con- 
tinuously than by partially reinforced 


subjects. Intratrial explanations ascribe 


the PRE to some kind of learned 
mechanism developing on each trial in 
acquisition and having its effect, as- 
sociatively, later in extinction. There 
have been two classes of intertrial ex- 
planations. In both cases the classes 
can be termed Cognitive-Expectancy 
(C-E) interpretations, and Stimulus- 
Response (S-R) interpretations. 
Generally speaking, the intertrial 
interpretations were an earlier develop- 
ment than the intratrial interpretations. 
Early C-E interpretations were the 
“common sense” expectancy hypothe- 
sis which emerged out of the early 
important experiments of Humphreys 
(e.g., 1939, 1940) and the discrimina- 
tion hypothesis developed and tested 
by Bitterman and his associates (e.g., 
Bitterman, Fedderson, & Tyler, 1953). 
On the S-R side, the major intertrial 
explanation (apart from the “response 
unit” interpretation which applies to 
chained or highly massed responses 
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but not to discrete trial learning) was 
the Hull-Sheffield hypothesis tested by 
Sheffield (1949). Weinstock’s (1954) 
demonstration that partial reinforce- 
ment effects were not eliminated even 
when trials were separated by 24 
hours, and an experiment by Wilson, 
Weiss, and Amsel (1955) disconfirm- 
ing the earlier results of Sheffield 
helped to set the stage for, and indeed 
to provide, intratrial explanations of, 
the PRE. Much later, in fact very re- 
cently, there has been strong contra- 
indication for a discrimination hy- 
pothesis in the work of Jenkins 
(1961b) and of Theios 1962). 
Jenkins, using pigeons in a Skinner- 
type situation, and Theios, using rats 
in a runway, have found that the PRE 
is sustained when a large number of 
continuous reinforcements are inter- 
polated between discrete trial partial 
reinforcement and extinction. This 
means that a large number of continu- 
ous reinforcements immediately be- 
fore extinction and following partial 
reinforcement does not neutralize the 
partial reinforcement effect. Since the 
basis for discriminating acquisition 
from extinction is now the same for 
partially as for continuously reinforced 
subjects, and the PRE persists, an 
interpretation is required, as Theios 
indicates, in terms of “relatively per- 
manent effects of nonreinforcement 
which can be sustained through blocks 
of continuous reinforcement.” 
Intratrial explanations of the PRE 
concern themselves in one way or an- 
other with processes following non- 
reward. Until recently, the intratrial 
explanations were the exclusive prop- 
erty of S-R_ psychologists. Wein- 
stock’s (1954) contiguity-interference 
hypothesis was perhaps the earliest of 
these, but was followed in short order 
by a variety of neo-Hullian hypothe- 
ses. All of these are what I would 
term conditioning-expectancy hypothe- 
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ses as opposed to cognitive-expectancy 
hypotheses of the PRE. All depend 
upon constructs of the anticipatory 
goal response type, representing re- 
sponses which develop in strength 
within trials. There are three neo- 
Hullian interpretations of the .2E, all 
of which were first proposed to account 
for particular experimental findings. 
The first emerged out of a study by 
Wilson, Weiss, and Amsel (1955) and 
was subsequently elaborated and tested 
in some detail (Amsel, 1958). It at- 
tributes the PRE to the evocation of 
frustration by nonreward in partial 
reinforcement, to the conditioning of 
anticipatory frustration (rp — sp) to 
cues in the instrumental sequence, and 
to the association of sp with approach 
responses in acquisition. Logan’s in- 
terpretation of the PRE (Logan, 
Beier, & Kincaid, 1956) is that it 
depends on the extinction of ff}; 
to “postreinforcement-time cues.” 


Kendler and his associates (Kendler, 


Pliskoff, D’Amato, & Katz, 1957) 
have offered an interpretation similar 
to the one I have favored, except they 
choose a neutral designation of non- 
reward effects in acquisition and refer 
to anticipatory nonreward and _ not 
to anticipatory frustration. Spence 
(1960) has recently come out in favor 
of the same kind of interpretation and 
has employed the ry construct to ac- 
count for acquisition phenomena in 
partial reinforcement. Consequently, 
it seems safe to say that a prominent 
interpretation of the PRE in _ neo- 
Hullian writings is a conditioning-ex- 
pectancy position with identifies frac- 
tional anticipatory frustration as well 
as fractional anticipatory reward. 
Recently, Festinger (1961) has 
called attention to the importance of 
“insufficient rewards” in a cognitive- 
expectancy (cognitive dissonance) in- 
terpretation of the PRE. Cognitive 
dissonance (Festinger, 1957) is a hy- 
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pothesis which recognizes the active 
properties of nonreward. In its use of 
delay of reward procedures to enhance 
attractiveness of a goal in a delay of 
reward situation, Festinger’s treatment 
of insufficient rewards is very similar 
in conceptualization to some earlier 
work of Olds (1953, 1956). On the 
basis of experiments with children, 
Olds concluded that the reward value 
of a secondary rewarding stimulus 
(S") could be increased—and hence the 
strength of approach responses leading 
to S™ could be increased—by delaying 
the presentation of S* after overlearn- 
ing. He showed that tokens which 
could ultimately be exchanged for 
more primary reward increased in re- 
ward value when their presentation 
was delayed, if these delay trials fol- 
lowed a number of trials in which their 
presentation was immediate. Olds 
suggested that such a procedure con- 
stitutes “practice at wanting” and that 
such practice at wanting increases the 
value of that which is wanted (S'‘). 
Festinger would seem to be taking the 
same position in his animal research. 
In one of several experiments dealing 
with dissonance in rats, he employs a 
double runway in which a start box is 
separated from a mid box by an alley, 
and the mid box is separated from an 
end box by another alley. Subjects 
are delayed in the mid box in the 
manner of Holder, Marx, Holder, and 
Collier (1957) and Wagner (1959), 
and the finding is that the subjects de- 
layed in the mid box, before being al- 
lowed to run to the end box and find 
food, will continue to run to the mid 
box longer on test trials than the sub- 
jects always fed in the mid box as well 
as in the end box. Festinger’s (1961) 
interpretation is that the delayed sub- 
jects develop dissonance as a result of 
the delay, and that the “dissonance 
tends to be reduced by developing some 
extra preference about something in 





312 ABRAM 
the situation. The existence of this 
extra preference leads to the stronger 
inclination to continue running during 
extinction trials” (p. 9). 

The position of Festinger, like the 
earlier one of Olds, is clearly that non- 
reward (or delay of reward) is an 
active factor determining an increased 
want or attraction. However, there 
seems to be an important difference in 
what increases in attractiveness be- 
tween Festinger’s earlier treatments 
of dissonance in humans (Festinger, 
1957) and his application of the con- 
cept to the explanation of the PRE in 
rats (Festinger, 1961). While speci- 
ficity is not always a characteristic of 
cognitive theory, it 


does seem as 


though the 1957 treatment (and even 
parts of the 1961 paper) did specify 
that the violation of a cognitive expec- 
tation produces dissonance which is 
resolved by upgrading the value of the 
previous goal object. 


However, in the 
treatment of “insufficient rewards” in 
terms of the prototype experiment with 
start box, mid box, and end box, dis- 
sonance is said to develop in the mid 
box, where the subject never has found 
food; and dissonance is reduced “by 
developing a preference for something 
about the place where the dissonance 
was introduced... by developing 
some liking for the place where they 
were not rewarded...” (p. 10). 
This latter position is used to explain, 
in terms of dissonance, why the sub- 
jects never fed in the mid box run 
longer to the mid box on nonreward 
test trials than the subjects always fed 
there before. The same thinking is 
then carried over to explanation of the 
PRE. In terms of the S-R analysis, 
the subjects have been trained to ap- 
proach the mid box of Festinger’s 
situation despite anticipatory frustra- 
tion (sp) cues because, in the period 
prior to testing, they were always 
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finally fed in the end box. In C-E 
terms the subjects developed an extra 
preference for the nonreward situation 
—the mid box in which they had never 
been rewarded. This would leave to 
be answered such questions as: Why 
does successive nonreward result in 
extinction? And, how do the subjects 
learn to discriminate cues leading to 
reward from leading to non- 
reward? 

The S-R treatment of nonreward ef- 
fects in partial reinforcement (Amsel, 
1958; Kendler, Pliskoff, D’Amato, & 
Katz, 1957; Spence, 1960; Wilson, 
Weiss, & Amsel, 1955) does not sug- 
gest that nonreward (or delayed re- 
ward) increases “wanting” in Olds’ 
terms, or “preference about something 
in the situation” in Festinger’s. This 
type of neo-Hullian approach is a de- 
scription of a mechanism operating 
through anticipatory nonreward which 
can lead to either rapid extinction of 
behavior or to increased resistance to 
extinction, and which can either facili- 
tate discrimination learning or retard 
it. I will claim that neither increased 
“wanting” nor “preference’’—in fact, 
no “common sense” cognitive inter- 
pretation—can account for all of the 
effects which nonreward seems to 
have. 

The difference between the two posi- 
tions, in the cognitive language of Olds 
and Festinger, is as follows: While 
the C-E position has nonreward 
increasing attractiveness of the non- 
reward situation, the S-R_ position 
implies a decrease in attractiveness. 
The partial reinforcement effect is, 
according to Festinger, due to in- 
creased attractiveness. According to 
my position it is the result of training 
to persist in behavior despite decreased 
attractiveness. In some other respects 
the positions are similar: both make 
nonreward the important determinant 


cues 
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of the effect, and both stress the im- 
portance of developing (intratrial) as- 
sociative tendencies during acquisition 
as determinants of ultimate resistance 
to extinction. Neither necessitates an 
assumption that the subject discrimi- 
nates acquisition from extinction (Bit- 
terman, Fedderson, & Tyler, 1953). 
The details of the S-R_ position 
which I favor have been presented 
elsewhere (Amsel, 1958) in the form 
of a sequence of hypotheses identified 
with four stages of practice in dis- 
crete trial partial reinforcement or in 
discrimination learning (the _ first 
three stages are essentially the same 
for both). A fifth hypothesis which 
applies to discrimination learning has 
been added, and this will be important 
for some data which follow. The es- 
sence of the position is outlined below 
in relation to an apparatus we have 
been using. It was designed to dif- 
ferentiate the immediate motivating 
(energizing) effects of frustration ex- 
emplified by the so-called Frustration 
Effect (FE) from fractional anticipa- 
tory frustration (rp), which represents 
the inhibiting feature of frustration. 
The apparatus is essentially, two run- 
ways in series, either in a straight line 
(e.g., Amsel & Roussel, 1952) or in 
the form of an L (eg., Amsel & 
Hancock, 1957). The apparatus has 
a start (SB), a first runway 
(Runway 1), a first goal box (G1), a 
second runway (Runway 2), and a 
(G2) in 
Reward and nonreward 


box 


second goal box successive 
arrangement. 


are manipulated in Gl, and the ap- 


paratus permits a separation of the 


frustrated response and changes in 
running speed indicating rp (measured 
in Runway 1), from the frustration 
motivated response and changes indi- 
FE Run- 


cating the (measured in 


way 2). 
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The sequence of hypotheses in terms 
of five stages of practice is as follows: 

1. In Stage 1, rz —spz* is developing 
in Runway 1 with early rewards, and 
nonreward (in Gl) has no particular 
effect, as measured by the FE in 
Runway 2. 

2. After the development of rz — sp 
for the Runway 1 response, nonre- 
wards elicit frustration, measured by 
the FE in Runway 2. This is Stage 2. 

3. When nonrewards elicit frustra- 
tion, the cues in Runway 1 previously 
evoking rp now also begin to evoke rp. 
In Stage 3 these antedating goal re- 
sponse tendencies are temporarily in 
competition. 

4. Since rg and rp cannot be elicited 
separately by differential cues in par- 
tial reinforcement, as they can in the 
latter stages of discrimination learn- 
ing, and since the temporary conflict 
in partial reward training is resolved 
in favor of running to the intermit- 
tently rewarding goal box, sr becomes 
associated with the instrumental ap- 
proach response in Stage 4 of par- 
tial reinforcement training, providing 
the mechanism for the partial rein- 
forcement effect. When extinction is 
carried out, partially reinforced sub- 
jects have been trained to respond (ap- 
proach in the presence of antedating, 


81 employ the term, rr — sr, in the same 
manner as it is used in an earlier develop- 
ment, and for the same reason: ‘ 
is a general term covering all types of ante- 
dating conditioned response. Separately, 
these might be designated rr — sr, fractional 
anticipatory reward; rp — sp, fractional an- 
ticipatory punishment, usually termed fear or 
anxiety; and rr — sr, fractional anticipatory 
frustration” (Amsel, 1958, p. 102). The 
meaning of rr — Sr is, however, not different 
from ra—se as Spence (1960) uses it in a 
similar connection. All that is intended here 
is a clarification of terminology in light of 
the fact that goal events may be positive 
(such as to elicit approach) or negative 
(such as to elicit avoidance). 
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frustration produced stimuli), whereas 
consistently reinforced subjects have 
not. In discrimination situations there 
is a basis for the differential elicitation 
of approach and avoidance by sz and 
Sp respectively in Stage 4. 

5. Following discrimination, when 
ty consistently precedes nonreward, 
the FE following nonreward should 
grow weaker, since nonreward is de- 
fined as frustrating only in the presence 
of TR. 

I have been suggesting that in a very 
important area of the analysis of be- 
havior, the explanation of persistence 
of behavior in the face of nonrewards 
attributable to partial reinforcement, 
the role of nonreward as an active 
process has emerged in both cognitive- 
expectancy and in S-R conditioning 
interpretations. It would certainly 


seem to me that this is an area where 
the clearer, better worked out S-R lan- 
guage and conceptualization might pro- 


vide a handle for a more workable 
expectancy theory. 

There is, in addition, the matter of 
a basic difference between the cognitive 
and S-R interpretations of the PRE: 
the former says nonreward increases 
attractiveness; the latter says non- 
reward decreases attractiveness. Both 
positions can account for the PRE; both 
can account for faster extinction fol- 
lowing large rewards than following 
smaller rewards; both can account for 
faster extinction following many re- 
wards than following fewer rewards. 
However, a cognitive theory such as 
Festinger proposes would have dif- 
ficulty with a finding of Hulse (1958) 
which has recently been confirmed by 
Wagner (196la). In Hulse’s words: 
Bs large as compared with small 
rewards produced greater resistance to 
extinction if partial reinforcement were 
used, but Jess resistance to extinction 
if continuous reinforcement were used 
...” (p. 56). In the cognitive lan- 
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guage of Festinger this would have to 
mean that the subjects develop “some 
extra preference about something in 
the situation” (greater resistance to 
extinction) when rewards are small 
and continuous; and that they do not 
develop the extra preference, in fact 
quite the opposite, when rewards are 
small and partial. The result of Hulse 
and of Wagner is clearly in line with 
the S-R interpretation in terms of 
anticipatory responses: Extinction is 
faster following continuous rewards of 
large than of small magnitudes because 
of the greater FE in extinction and 
subsequent faster development of rr. 
However, in partial reinforcement 
there is opportunity to elicit FE and 
to build rp during acquisition. Under 
these circumstances, the larger magni- 
tude of reward permits the mecha- 
nism for the PRE—the connection of 
Sp to continued approach—to develop 
more strongly. Consequently extinc- 
tion should be slower following partial 
rewards of large magnitude than fol- 
lowing partial rewards of small magni- 
tude. 

I am claiming that the S-R condi- 
tioning analysis of nonreward effects 
has much to recommend it. The ques- 
tion is, How much? To break the 
question down: Does inclusion of the 
concepts rg and rp in the analysis of 
partial reinforcement and discrimina- 
tion learning provide us with explana- 
tory power which a more parsimonious 
approach-avoidance analysis (e.g., 
D’Amato & Jagoda, 1960; Nissen, 
1950) does not? Does this type of 
analysis of nonreward effects provide 
predictive power which a cognitive 
treatment of nonreward (e.g. Fes- 
tinger, 1961) does not? 

My attempts to convince the reader 
that the answer to these last two ques- 
tions is “yes” will take two forms: 
(a) Some data will be presented which 
bear on predictions already made on 
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the basis of this type of analysis. I will 
claim that data such as these are not 
readily deducible from either a less 
specific cognitive language or from a 
more parsimonious approach-avoidance 
language. (b) I will offer an exten- 
sion of frustrative nonreward theory 
designed to generate predictions about 
the course of discrimination learning 
following various prediscrimination 
experiences with partial and continu- 
ous reward. 

Before proceeding to this extension 
of the theory of frustrative nonreward, 
a brief comment on the relation of the 
present position on nonreward to other 
theories of frustration. As I have 
indicated before (Amsel, 1958), “. . . 
we have been interested in certain 
active properties of nonreward follow- 
ing reward and no more than this is 
meant by the term frustration as we 
use it” (p. 103). Nevertheless, the 


mechanism which has been outlined to 
explain the PRE, particularly the con- 


ditioning of rp and the connection of 
sp to continued approach as a result 
of intermittent reward, is certainly a 
candidate for the learning theory 
counterpart of “frustration tolerance” 
as employed in social psychology and 
psychopathology. The treatment of 
frustration from the viewpoint of theo- 
ries of personality is beyond the scope 
of this paper; and there is no intent 
here to grapple with the multiple 
meanings the concept has in those 
contexts. However, it would seem 
possible to recast portions of the frus- 
tration theory of Rosenzweig (e.g. 
1944) which deal with nonreward 
(failure or delay of “gratification’’) 
into the present terminology; and the 
treatment of frustration by Rotter 
(1954), as a discrepancy between re- 
ward expectancy and its occurrence, 
seems compatible with the mechanisms 
hypothesized here and with the theo- 
retical extension I will propose. 
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NONREWARD EFFECTS IN 
DISCRIMINATION LEARNING: 
SomE DATA 


According to the sequence of five 
hypotheses on the role of nonreward 
and partial reinforcement in discrimi- 
nation, it should be possible to show 
that the FE, measured in Runway 2 
of a double runway, bears certain rela- 
tionships to various stages of discrimi- 
nation learning. More specifically, 
(a) evidence of nonreward related 
frustration, measured in Runway 2, 
should precede any evidence that the 
subjects are running faster to S+ and 
slower to S— in Runway 1, and (b) at 
some point following the appearance of 
discriminative behavior in Runway 1, 
a reduction in the magnitude of the FE 
measured in Runway 2 should become 
apparent. 

In an unpublished doctoral disserta- 
tion, Ward (1961) employed the 
double runway to record discrimina- 
tion to positive (eg., black) and 
negative (e.g., white) cues presented 
singly in Runway 1, and the concur- 
rent development of frustrative effects 
in Runway 2. A single runway condi- 
tion (Runway 1 alone) was used to 
study the same discrimination uncon- 
taminated by Runway 2 performance. 
There were 20 subjects in the double 
runway group and 10 subjects in the 
single runway comparison § group. 
They were run six trials a day for 30 
days, and the intertrial interval was 
about 10 minutes. Figure 1 presents 
data for the 20 subjects in the double 
runway. The top panel shows the 
developing FE in Runway 2. The 
bottom panel shows discrimination 
starting at about Day 7 (42 trials), 
and at a point well after the beginning 
of FE development on Day 3 (18 
trials). There is an indication that the 
FE almost disappears (perhaps only 
temporarily) at a point where separa- 
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Fic. 1. The appearance and diminution 
of the frustration effect in Runway 2 (top 
panel) in sequential relationship to the de- 
velopment of the successive discrimination 
in Runway 1 (bottom panel). (From Ward, 


1961) 


tion of the discrimination curves is 
greatest. This occurs at about Day 15 
(90 trials). 

It could be argued the Ward result 
is an artifact: that different subjects 
reach criterion at different times. An- 
ticipating this, I selected the data of 
seven of Ward's subjects which 
reached the discrimination criterion on 
the same block of trials. Figure 2 
presents these data in 2-day blocks 
and shows essentially the same rela- 
tionships as in Figure 1. 

Figure 3 presents data from the 
single runway comparison group and 
shows the Runway 1 discrimination 
unaffected by any subsequent Runway 
2 performance. Even in the absence 
of the second runway there is a tend- 
ency for the N and R curves of the 
discrimination to converge on later 
trials (see 3erry, Verplanck, & 


Graham, 1943). These data of Ward, 
which have been duplicated in our 
laboratory as by-products of other 
studies, are taken as evidence for the 
involvement of frustrative effects in 
discrimination in a manner such as our 
analysis suggests. 

In another experiment using the 
same double runway apparatus, Ward 
(1961) studied rate of discrimination 
learning in Runway 1 as related to 
number of prediscrimination rewards. 
In this experiment, which was similar 
to an earlier one of Shoemaker (1953) 
performed in a single runway, the re- 
sponse of running in Runway | and 
entering G, was rewarded 0, 12, or 
48 times prior to learning a_black- 
white discrimination in Runway 1. 


An implication of our position is that, 
with initial positive tendencies to the 
two discriminanda equal, rate of dis- 
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Fic. 2. These are the relationships shown 
in Figure 1 for seven subjects selected be- 
cause they reached the criterion of discrimi- 
nation on the same block of trials. (Adapted 
from Ward, 1961) 
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Fic. 3. The successive discrimination in 
a group which ran only in Runway 1. (Ex 
treme N times going up to 3 or 4 seconds 
are not shown. Note the decline of the non- 
reward curve in the single runway group as 
in the double runway groups.) 


crimination learning should be a posi- 
tive function of the strength of rpg to 
the negative stimulus. This meant 
that prediscrimination trials were all 
rewarded in G, and that Runway 1 was 
black on half of these trials and white 
on the other half. On discrimination 
trials, Runway 1 was black when G, 
contained reward and white when G, 
contained no reward (or vice versa). 
Second runway performance was meas- 
ured on every discrimination trial fol- 
lowing reward or nonreward in G, to 
get a measure of the FE, 
strength at different stages could then 
be related to the developing discrimina- 
tive performance. Table 1 shows the 
results of this experiment both 
running time and goal entry measures 
A more 


whose 


for 


for two criteria of learning. 
detailed analysis of the data in Ward’s 
dissertation also indicates the appear- 
ance of the FE before discrimination 
and its diminution or disappearance 
following discrimination. These data 
are in accord with, and go a little 
beyond, previously cited studies which 
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stress the importance of nonreward in 
discrimination. 

A recent experiment by Jenkins 
(196la) demonstrates the importance 
of nonreward effects in discrimination 
learning in another way. Using 
pigeons as subjects, a discrete trial 
procedure, and key pecking as the cri- 
terion response, Jenkins demonstrated 
that the positive stimulus elicits twice 
as many responses to extinction follow- 
ing discrimination training as does the 
same stimulus following an equal num- 
ber of continuous reinforcements. On 
the basis of an ingenious arrangement 
of controls, Jenkins is able to conclude 
that the increased resistance to extinc- 
tion is not due to the fact that the 
positive stimulus is eventually dis- 
criminated from the negative, but 
rather to the occurrence of nonrein- 
forced the 
learning the discrimination. 


responses in process of 


FRUSTRATIVE NONREWARD THEORY 
AND PREDISCRIMINATION EFFECTS 


In the Ward experiment, subjects 
were exposed before a discrimination 


TABLE 1 


MEAN Day ON WHICH CRITERION OF 
CRIMINATION REACHED BY GROUPS 
FOLLOWING 0, 12, oR 48 PRE- 
DISCRIMINATION B+ W+ 
TRIALS 


Dis- 


Group 
Condition 
Running time 


Easy criterion 8.0 6.5 
Hard criterion | 12.8 | 10.3 


Goal entry time 


Easy criterion 7.2 4.3 
Hard criterion | 10.9 | 8.1 


*p < .05. 
*> < 01. 
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to both of the stimuli, and approach 
responses in the presence of both were 
always rewarded. The Ward experi- 
ment represents continuous (predis- 
crimination) reward experience with 
both discriminanda. Now let us extend 
frustrative nonreward theory to a class 
of prediscrimination treatments broader 
than that represented by the Ward 
experiment. 

Consider how discrimination learn- 
ing will be affected not only by 
continuous prediscrimination reward 
experience but also by partial pre- 
discrimination reward experience in 
relation to one or to both of a pair of 
discriminanda. I will restrict the dis- 
cussion, in this paper, to a black-white, 
successive discrimination (single pres- 
entation of stimuli). The problem is 
to make predictions about the course 
of such a discrimination on the basis 
of some history of consistent or of 
partial reward experience in relation to 
one or both of the eventual discrimi- 
nanda. Discrimination is measured 
in terms of amplitude (running time) 
changes to the positive and negative 
discriminanda. 

Table 2 lists several possible kinds 
of exposure to discriminanda prior to 
a black-white discrimination. As we 
examine these cases, it is clear that 
theorists of learning have been inter 


TABLE 2 


KINDS OF PREDISCRIMINATION EXPERIENCE 
IN RELATION TO A BLACK (+) 
Waite (—) DISCRIMINATION 


One Prediscrimination Stimulus 


B+ Continuous Reward, Positive Stimulus 

W-+ Continuous Reward, Negative 
Stimulus 

B+ Partial Reward, Positive Stimulus 

W+ Partial Reward, Negative Stimulus 


Two prediscrimination stimuli 


B+ W-+ Continuous Reward, Both Stimuli 
B+ W+ Partial Reward, Both Stimuli 
B— W-+ Discrimination Reversal 
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ested mainly in prediscrimination ex- 
perience involving both discriminanda. 
An early example is the continuity 
versus noncontinuity experiments 
which varied amount of presolution 
experience with the discrimination 
problem itself (B+W-—) or with its 
reverse (B—W+), and with receptor 
orienting activity in relation to the 
discriminanda (eg., Ehrenfreund, 
1948; Gatling, 1951; Krechevsky, 
1932, 1938; Lashley, 1942; Spence, 
1940, 1945). More recent work on 
prediscrimination treatments as_ they 
influence later discrimination learning 
has taken several forms including 
Harlow’s (e.g., 1949, 1950, 1957) ex- 
periments on learning sets, Lawrence’s 
(1949, 1950, 1952) studies on acquired 
distinctiveness of cues, and the work 
relating overlearning to discrimination 
reversal and transfer (e.g., Birch, 
Ison, & Sperling, 1960; Capaldi & 
Stevenson, 1957; Pubols, 1956; Reid, 
1953). These newer interests in pre- 
discrimination experience have been 
somewhat less oriented than the earlier 
ones toward theoretical argument. In 
fact, they have to some extent tended 
to bring together the earlier wide split 
between S-R and cognitive approaches 
to discrimination, while investigating 
mediational affecting dis- 
crimination. Perhaps a point has been 
reached where it may be profitable to 
begin looking at the prediscrimination- 
discrimination sequence in a way 
which will bring the cognitive-expect- 
ancy position even more into the fold 
of an S-R_ (conditioning-expectancy ) 
interpretation. It seems to me that a 
treatment of discrimination phenomena 
in terms of anticipatory (conditioned) 
mechanisms does just this. 

Consider the following “attributes” 
of prediscrimination exposure to a 
single stimulus: (a) The prediscrimi- 
nation stimulus may become either the 
positive or the negative stimulus in the 
eventual discrimination. (b) The re- 


processes 
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sponse to this prediscrimination stimu- 
lus may be partially or continuously 
rewarded. (c) The number of predis- 
crimination exposures to the stimulus- 
response-reward (or nonreward) se- 
quence may be large, small, or of 
some intermediate value. Figure 4 


4The values for number of trials, 10, 32, 
and 120, which appear in Figures 4, 5, and 6, 
and in the text in various places, are speci- 
fied here for purposes of exposition and 
because we are now conducting experiments 
which involve these numbers of prediscrimi- 
nation trials. However, these numbers of 
trials only represent guesses, at the present 
time, as to when the various stages of prac- 
tice are reached. Rather than be pinned 
down to these numbers of trials in the pre- 
dictions for discrimination learning which 
follow, I would prefer to set up independent 
indices of the existence of one stage or 
another and to predict discrimination out- 
come on the basis of these indices rather 
than some number of trials. The index for 
the first stage is relatively safe and simple. 
It is a stage at which nonreward produces 
no evidence of frustration as measured by 
the FE in a double runway, or in any other 
way. If there is no frustration, there can 
be no rr. The presence of rr takes us into 
the second stage. The choice of a number 
of trials to designate the second stage is 
most difficult since it must represent a very 
critical phase in partial reinforcement: 
enough trials for rr to be present but not 
so many that sr is associated with approach. 
The independent indicator of the middle 
stage (rr, but sr not associated with ap- 
proach) is increased variability of behavior 
after nonreward produces the FE 
(Amsel, 1958). Stopping partial reward 
acquisition at such a point defines this middle 
stage. The late stage (identified here, 
conservatively, as 120 trials) has a clear 
independent definition. It exists (a) when 
variability decreases again, but more particu- 
larly (b) when partial reward performance 
exceeds continuous reward performance in 
vigor (see Goodrich, 1959; Haggard, 1959; 
Spence, 1960; Wagner, 196la). While it 
might appear from earlier partial reinforce- 
ment experiments that 32 trials of partial 
reinforcement are sufficient to get us to the 
last stage and produce the PRE (and there- 
fore are too many trials for our intermediate 
condition) experiments demonstrating PRE 
with this many acquisition trails (e.g., Shef- 
field, 1949; Weinstock, 1954) have usually 
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Fic. 4. Schema showing 24 prediscrimi- 
nation-discrimination sequences (12 condi- 
tions, each counterbalanced) resulting from 
three attributes of prediscrimination ex- 
posure to a single discriminandum. 


shows the various possible combina- 
tions of a, b, and ¢ for a black-white 
discrimination. The 12 cells of this 
schema represent a matrix of possible 
experimental conditions, each counter- 
balanced for absolute effects of color. 

What can be predicted from a be- 
havior theory with frustrative non- 
reward premises about the effects of 
prediscrimination experiences in rela- 
tion to a single discriminandum on 
subsequent black-white discrimination ? 
Is it possible to show that all combina- 
tions of a, b, and ¢ and also effects of 
prediscrimination experience involving 
both discriminanda will be deducible 
from a relatively small number of prin- 
ciples? In the next four sections, I 
will look at four kinds of prediscrimi- 
nation treatment and, for each one, try 
to see what the theory leads us to 
expect. 


Partial Reward of a Response to One 
Discriminandum Prior to Discrimina- 
tion 


Our guiding conception, when the 
prediscrimination treatment involves 





involved an initial period of about 10 con- 


tinuous rewards. Our procedure involves 
partial reward from the outset. 
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partial reinforcement and a sufficiently 
large number of trials, is that antici- 
patory frustration stimull 
(sp) become associated with approach 
responses. (Miller, 1960, has sug- 
gested a similar mechanism—‘learning 
to advance to the cue of fear”- 
count for the learning of resistance to 
fear.) The reasoning is that while sp 
will at first elicit responses antagonistic 


produced 


-to ac- 


to approach, it will finally become con- 
nected to approach tendencies. This 
means that at some stage in partial 
reinforcement sy will become part of 


a stimulus complex evoking approach, 


and earlier evidence of conflict will 
disappear. 

Of the four stages of practice in 
partial reinforcement previously out- 
lined, three are theoretically differenti 
able with respect to the involvement of 
rr, and therefore sp: a stage where rp 
has been conditioned, but not yet rp 
(10 trials) ; a stage where both rp and 
ry have conditioned and _ their 
stimuli, sz and sy, still evoke competing 
(approach and avoidance) response 
tendencies (32 trials); and a 
where both rp and ry are conditioned 


been 


stage 


and the stimuli they produce, sg and 
sr, both evoke approach tendencies 
(120 trials). This conceptualization 
of three stages in partial reinforcement 
in regard to rp—sp, along with an 
assumption borrowed from Miller's 
analysis of conflict and displacement 
(1944, 1948), will provide the basis 
for deductions which I find 
interesting. 

The assumption from Miller’s analy- 
sis of conflict, for which there is 
experimental support, is that the gen- 
eralization gradient for positive (ap- 
proach) tendencies is flatter than that 


some 


Lor vd 


for negative (avoidance) tendencies. 
If we now identify sge—>App (which 
results from rg) and sp—Av (which 
results from rp) as such positive and 
negative tendencies, the three panels 
of Figure 5, from top to bottom re- 
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STRENGTH OF RESPONSE TENDENCY AT 
BEGINNING OF DISCRIMINATION FOLLOWING 
PREDISCRIMINATION PARTIAL REWARD 
TO BLACK (B*) 





EARLY STAGE 
(10 TRIALS) 





MIDDLE STAGE 
(32 TRIALS) 


o— Rae APP Racy 
- 
*s, 
~~." 4¥0%¢ 
~ ~“4ne, 





LATE STAGE 
(120 TRIALS ) 











BLACK WHITE 


RELATIVE STRENGTH OF RESPONSE TENDENCIES 


DISCRIMINATION STIMULUS 
Fic. 5. An application of an assumption 
from Miller’s analysis of conflict to predic 
tions of ease of discrimination learning fol- 
lowing various prediscrimination exposures 
to a single discriminandum. 


spectively, represent the state of af- 
fairs at the beginning of a discrimina- 
tion, when the discrimination has been 
preceded by a few, an intermediate 
number, or a large number of partial 
reinforcements of a response to the 
prediscrimination stimulus (B). Rep- 
resented on the left-hand side of the 
graph are the relative strengths of ap 
proach and avoidance tendencies to 
the stimulus (B) 
through the mediating response pro- 
duced stimuli, sp and sp. Shown on 
the right are the generalized strengths 
of these tendencies to the stimulus 
(W) which will be new in the dis- 
crimination. (If the prediscrimination 
stimulus were W, then B would.be the 
new stimulus in the discrimination and 
the B and W positions on the baseline 
would of course be reversed. ) 

A simplifying assumption, repre- 
sented in the bottom panel of Figure 5, 
is that the strength of approach, elicited 
through rp—sp by B and W in the late 


prediscrimination 
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stage, is related to the strength of 
avoidance tendencies elicited through 
ry—sp in the middle stage. I am sug- 
gesting that the avoidance tendency to 
Sp is “neutralized” and has the effect 
of an approach tendency at this stage. 

Some predictions can now be made 
relative to the conditions in Figure 4, 
and these will be restricted to the 
underlined conditions. (The other 
conditions simply counterbalance for 
any absolute effects of color and [ as- 
sume they will not be different in 
direction.) Again, let me emphasize 
that the predictions will differ depend- 
ing on the depth dimension: Was 
prediscrimination training discontinued 
at the stage of development of rp, but 
not rr, indicated by the point of earliest 
appearance of FE? Or, was training 
carried on to the intermediate conflict 
stage at which sp evokes avoidance? 
Or, were a very large number of trials 


run, allowing sp to become connected 
to approach? Outlined below, for each 


of these cases, are predictions as to 
which discrimination will be learned 
faster: that in which the prediscrimi- 
nation stimulus remains positive in the 
(B+-—B+W-), or 
negative 


discrimination 
that in which it 
(B+—B-W+). 

Let us begin with the case of dis- 
crimination following 120 partial re- 
wards. The Figure 
5): At the start of discrimination, B 
will elicit rz and rp strongly and about 
equally. On the other hand, W, which 
is new in the discrimination, will elicit 
rr relatively strongly because of the 
flat positive gradient, but will elicit rp 
very weakly because of the steep nega- 
tive gradient. This means that when 
B is made negative in the discrimini- 
nation after a great many prediscrimi- 
nation trials, B— will elicit strong 
rpy—Sp, and sp will elicit continued ap- 
proach. This will slow discrimination 
relative to the condition where B is 
made positive in the discrimination. 


becomes 


reasoning (see 
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Here the W— stimulus which is new 
in the discrimination elicits very weak 
ry. Consequently, there is little sp 
to elicit approach, little to counteract 
the build-up of avoidance to W-—, and 
discrimination should develop more 
quickly. The theory predicts the fol- 
lowing: After a very large number of 
prediscrimination partial rewards of 
an approach response to B, B+W-— 
will produce faster discrimination than 
will B—W-+. This prediction and its 
relationship to other predictions are 
shown in Figure 6. 

The most interesting and critical 
comparison to be made for purposes 
of our analysis is between the 120- 
prediscrimination-trial case and the 
intermediate, 32-prediscrimination-trial 
case. In the 120-trial case sp is al- 
ready connected to approach, while in 
the 32-trial case sp still elicits avoid- 
Consequently the predictions in 
two reversed. The 


ance. 


these cases are 
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RELATIVE RATE OF DISCRIMINATION LEARNING 











120 
NUMBER OF PRE-DISCRIMINATION TRIALS 


Fic. 6. Predicted rate of discrimination 
learning following various prediscrimination 
or both of the eventual 


exposures to one 


discriminanda. 
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reasoning for the intermediate case is 
that both rg and rp are being evoked 
by B at the start of discrimination, but 
Sp elicits approach and sp elicits avoid- 
ance. When, after 32 trials, we switch 
from prediscrimination B+ to the 
B+W-— discrimination, B+ evokes 
both approach and avoidance tenden- 
cies through sg and spy, and W-—, new 
in the discrimination, evokes mainly 
generalized approach (rp) and little 
avoidance (rg). When the shift is 
from B+ to B—W-4+, there is a strong 
tendency for sp to elicit avoidance in 
relation to the now negative B— (gen- 
eralizing very little to W+) and a 
strong tendency for sp to elicit ap- 
proach to the new, now positive, W+. 
This latter situation is much more 
favorable for discrimination. The pre- 
diction: After an intermediate number 
of partial rewards of an approach 
response to B, B+W-— will pro- 
duce slower discrimination than will 
B-W-+. 

After a few B= trials, enough to 
produce some r, but little or no FE 
and no ry, there should be no differ- 
ence in the rate of the B+W-— and 
B—W+ discriminations. 


Continuous Reward of a Response to 


One Discriminandum Prior to Dis- 


crimination 


We deal here with the underlined 
conditions on the right of Figure 4: 
continuous (B+) _prediscrimination 
experience followed by a discrimina- 
tion in which B remains positive 
(B+W-—) or _ becomes negative 
(B—W+). The expectations here 
are most interesting when we compare 
them to the predictions just made 
for prediscrimination partial reward. 
sriefly, with increasing numbers of 
rewards of approach responses to the 
prediscrimination stimulus, B+ should 
evoke increasing rg, and nonreward in 
the subsequent discrimination should be 
increasingly frustrating. Since rz gen- 
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eralizes strongly from B to W, both B 
and W should evoke rpg quite strongly 
and almost equally in the discrimina- 
tion. Nevertheless, the FE should be 
somewhat greater when B is made the 
negative stimulus in the discrimination 
than when it stays positive. However, 
in either case (B+—B+W-— or B+ 
—B—W+), the flat gradient of gen- 
eralization of positive tendencies leads 
to a prediction which is the same as 
the earlier Ward result: increasing the 
number of continuous prediscrimina- 
tion rewards should produce faster dis- 
crimination. 

In contrast to the center panel of 
Figure 6 in which the relationships at 
the middle and end points are reversed, 
the differences between the two curves 
in the top panel are shown as consist- 
ently in favor of faster discrimination 
for B+—B+W-. This is not a pre- 
diction based on frustrative nonreward 
factors alone. In fact, part of this is 
not a prediction at all. There is evi- 
dence in some unpublished data® that 
a switch from B+ in prediscrimina- 
tion to B— in discrimination does pro- 
duce a greater FE in Runway 2 than a 
switch from B+ to W-, as expected 
(Amsel & Hancock, 1957); however, 
an immediate generalization decrement 
of the Runway 1 response in switching 
from B to W weakens response to W, 
which speeds the apparent separation 
of the B+W-— discrimination curves 
and retards the separation of the 
B-—W-+ curves. The suggestion is 
that, in the case of prediscrimination 
continuous reinforcement, the factor of 
generalization decrement, operating 
against the somewhat greater FE when 
the prediscrimination stimulus _ be- 
comes negative in the discrimination, 
should maintain the relative advantage 
of B+—B+W-— over B+—B-—-W+ 


5 The reference is to data collected by 
Susan Hicks at Newcomb College, Tulane 
University, as part of an undergraduate 
honors project. 
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for all numbers of prediscrimination 
trials. In prediscrimination partial re- 
inforcement (middle panel of Figure 
6), since there is a switch-over in rela- 
tive positions from 32 to 120 trials, the 
factor generalization decrement 
should reduce the apparent difference 
at the intermediate point and increase 
the reversed difference at the 120-trial 


of 


point. 

While some of the implications of 
the analysis of variables represented 
by the schema in Figure 4 are cur- 
rently being tested using a runway 
apparatus with amplitude changes as 
the response measures, we plan also to 
test these ideas in terms of rate of dis- 
crimination in situations 
with single presentation of discrimi- 
In these latter situations, gen- 
eralization decrement effects due to 
introduction of new stimuli would still 
be expected to effect running speed; 
however, they might have less effect on 
a correct-choice indicant of speed of 
discrimination, and provide an addi- 
tional check on the implications of this 


two-choice 


nanda. 


position. 

We have been looking at situations 
in which the to a_ single 
stimulus cue is partially or continu- 
ly rewarded some number of times, 


"1 
OUD! 


response 


and this stimulus then becomes either 
the positive or the negative cue in a 
discrimination. The last two sections 
of this paper will outline some of the 
factors operating in prediscrimination 
experience involving both discrimi- 
nanda, 

Partial versus Continuous Reward of 
Responses to Both Discriminanda Pre- 
sented Separately Prior to Discrimi- 
nation 


The bottom panel of Figure 6 sum- 


marizes predictions about the rate of 
discrimination learning following pre- 


discrimination experience with both 
discriminanda presented separately. I 
am suggesting that B+W+—-B+W-— 
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operates essentially like B+—-B+W-— 
(or B+—-B—W+), except that it 
would not be susceptible to generaliza- 
tion effects, since neither B nor W is 
new in the discrimination. There is 
some evidence related to this condition 
from the Shoemaker and Ward experi- 
ments, but only for the part of the 
“function” covering the first two 
points. Analysis of B+W= as a pre- 
discrimination condition indicates a 
decreasing monotonic relationship be- 
tween rate of discrimination and num- 
ber of such prediscrimination trials. 
The reasoning is simply that partial 
reinforcement of approach responses to 
both stimuli will result in conflicting 
approach and avoidance response tend- 
encies to both B and W at the 32-trial 
stage; and the elicitation of approach 
responses to sp in connection with both 
B and W at the 120-trial stage. 
While the former represents a dif- 
ficult base on which to build discrimi- 
nation, as compared with B+W+-, the 
latter should be even worse for dis- 
crimination, since it involves the 
mechanism for the PRE (sp connected 
to approach) elicited by the negative 
stimulus in the discrimination. Conse- 
quently, a prolonged period of con- 
tinued approach to both B+ and W— 
would be expected in the discrimina- 
tion phase. 


Discrimination Reversal as a “Pre- 


discrimination” Phenomenon 


The evidence on discrimination re- 
versal is certainly not unequivocal ; 
however, the weight of it suggests a 
U shaped relationship between ease of 
discrimination learning (B+W-—) and 
number of prior B—W++ trials. This 
would look like the _B+—~B+W-— 
prediction (middle panel Figure 6) 
in direction. Reversal, at least up 
to the 32-trial point, would produce 
successively slower learning; but re- 
versal after many postcriterion trials, 
e.g., the 120-trial point, would facili- 
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tate the formation of the reversed 
discrimination. 

The discrimination reversal studies 
cited earlier have been conducted in 
situations involving joint presentation 
of stimuli. They involve two basic 
findings. The first concerns ease of 
discrimination following the first re- 
versal; the second concerns the “learn- 
ing to learn” phenomenon—that suc- 
cessive reversals shorten the numbers 
of trials required to criterion. For 
the first of these findings, there is scant 
evidence in situations involving single 
presentation of stimuli (the 
case) ; however the recent Birch, Ison, 
and Sperling study (1960) is impor- 
tant and relevant. 

The conditions of the Birch, 
and Sperling experiment are closest to 
meeting the boundary conditions of a 
frustrative nonreward analysis of re- 
versal “prediscrimination” phe- 
nomenon. Their subjects, rats, were 
given 11 trials each day, the first al- 


present 


Ison, 


as a 


ways positive, the next 10 divided be- 


tween positive and negative according 
to random orders. Their measure was 
response speed rather than choice, and 
their apparatus was essentially a run- 
way. They were trained initially to 
approach a white platform and to avoid 
a black one and, when they reached a 
criterion of discrimination defined in 
terms of approach speed, half of the 
3+W- 
discrimination immediately, the other 
half after 12 additional days of over- 
training to B—W+. The 
number of trials to reach the original 
criterion for both groups was about 50 
(well beyond our 32-trial point). This 
means that the overtrained group had 
about 180 trials to B—W+ (well 
beyond our 120-trial point) before it 
and B+W-. 
The overtrained group learned the re- 
versed discrimination in significantly 
fewer trials. 


subjects were switched to a 


average 


was reversed learned 
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Interpreting their results, birch, 
et al. (1960) state: the differ- 
ence between the two groups appears 
to be located in their differential rates 
of extinction to the previously rein- 
forced cue. No difference in rate of 
acquisition to the newly positive cue 
was found in the present data” (p. 39). 
results have a rather 
interpretation in terms of a frustrative 
nonreward position; but the situation 
is complex and there are theoretical 
difficulties to overcome. The obvious 
part of the interpretation is that the 
overlearned group 
stronger rp to W+, which makes 
subsequent W-— more frustrating, 
which facilitates the conditioning of W 
to rp, which, finally, ensures the evoca 
tion of avoidance responses 
This part is fine for the theory; how- 
ever, there is the matter of what hap- 
pens to sp, which presumably develops 
in the initial discrimination of the 
Birch, et al. experiment to the negative 
stimulus (B—) and which is connected 
to avoidance. Shouldn't the over- 
trained group have a stronger sp 
avoidance tendency in to B 
than the criterion group? And 
shouldn’t this retard the learning of 
the reversed discrimination in the 
overtrained group when B_ subse- 
quently becomes the positive stimulus ? 
This difficulty remains for the theory 
unless one assumes that ry (and sp 
approach) builds in strength very 
slowly over many trials (at least over 
50) while rp (and sp—avoidance) de- 
velops quickly and is maximal by about 
50 trials—this being the case despite 
the fact that the growth of rp depends 
This is 
certainly an interesting possibility. 


These obvious 


has developed 


by sp. 


——> 


relation 


upon the prior growth of rr. 


SUMMARY 
This paper begins with an account 
of recent evidence for a concept of 
frustrative nonreward in behavior 
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theory, and presents support for the 
position that an “active” role must be 
assigned to nonreward in both motiva- 
tion and inhibition. This evidence is 
not only from experiments specifically 
designed to study frustrative proper- 
ties of nonreward, but from a 
variety of other kinds of experiments, 
including, among others, studies of 
extinction as a function of magnitude 
of reward and number of rewards, and 
experiments in discrimination learning 
which demonstrate the relative impor- 
tance of nonrewarded rewarded 
trials. 

The importance of an active concep- 
tion of nonreward in current thinking 
with 
larities and differences between recent 
cognitive-expectancy and neo-Hullian 
interpretations of the partial reinforce- 
ment effect. The suggestion is that 
current S-R conditioning interpreta- 
tions of these partial reinforcement 
phenomena, stressing the role of both 
anticipatory reward and anticipatory 
frustration mechanisms, and involving 
a more specific and better worked out 
language, provide a more workable 
“expectancy” theory than do cognitive 
interpretations. 

The contention is that current neo 
Hullian approaches, employing condi- 
tioning-expectancy concepts, go beyond 
cognitive-expectancy approaches in 
both specificity and predictive power. 
As evidence for the merit of such a 
position, data are presented from dis- 
crimination learning experiments which 
test predictions from a conditioning 
analysis of nonreward effects, and an 
extension of frustrative nonreward 
theory is presented which leads to pre- 
dictions about the course of discrimi- 
nation learning following various pre- 
discrimination 
predictions depend on the interaction 
of three attributes of prediscrimination 
experience: (a) whether prediscrimi- 


also 


and 


is discussed reference to simi- 


experiences. These 
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nation exposure is to the stimulus 
which is eventually positive or negative 
in the discrimination (or to both) ; (b) 
whether the prediscrimination treat- 
ment involves continuous or partial 
reward; and (c) the number of 
prediscrimination exposures (trials), 
representing different stages of de- 
velopment of anticipatory reward and 
anticipatory frustration. 


REFERENCES 
ApeLMAN, H. M., & Maarscu, J. L. Re 


sistance to extinction as a function of the 
type of response elicited by frustration. 
J. exp. Psychol., 1955, 50, 61-65. 

ApvELMAN, H. M., & Rosenraum, G. 
tinction of instrumental behavior as a 
function of frustration at various distances 
from the goal. J. exp. Psychol., 1954, 47, 
429-432. 

AmseL, A. A three-factor theory of inhibi- 
tion: An addition to Hull’s two-factor 
theory. Paper read at Southern Society 
for Philosophy and Psychology, Roanoke, 
1951. 

Amset, A. The role of frustrative nonre- 
ward in noncontinuous reward situations 
Psychol. Bull., 1958, 55, 102-119. 

AmseL, A., ErnuArt, C. B., & GALBRECHT, 
C. R. Magnitude of frustration effect and 
strength of antedating goal factors 
Psychol. Rep., 1961, 8, 183-186. 

AmseL, A., & Hancock, W. Motivational 
properties of frustration: III. Relation of 
frustration effect to antedating goal fac 
tors. J. exp. Psychol., 1957, 53, 126-131. 

AmseL, A., & Penick, E. The influence of 
early experience on the frustration effect. 
J. exp. Psychol., 1962, 63, 167-176. 

AmseEL, A., & Prouty, D. L. Frustrative 
factors in selective learning with reward 
and nonreward as discriminanda. J. exp. 
Psychol., 1959, 57, 224-230. 

Amset, A., & Rousset, J. Motivational 
properties of frustration: I. Effect on a 
running response of the addition of frus 
tration to the motivational complex 
J. exp. Psychol., 1952, 43, 363-368. 

AmseL, A., & Warp, J. S. Motivational 
properties of frustration: II. Frustration 
drive stimulus and frustration reduction 
in selective learning. J. exp. Psychol. 
1954, 48, 37-47. 

Armus, H. L. Effect of magnitude of re 
inforcement on acquisition and extinction 


Ex- 





326 ABRAM 


of a running response. J. exp. Psychol., 
1959, 58, 61-63. 

Berry, R. N., 
GRAHAM, C. H 
nation in a simple running habit. J 
Psychol., 1943, 32, 325-334. 

Bircw, D. Discrimination 
function of the ratio of 
reinforced trials. J. comp 
chol., 1955, 66, 456-464. 

Bircu, D., Ison, J. R., & Spertine, S. E. 
Reversal learning under single stimulus 
presentation. J. exp. Psychol., 1960, 60, 
36-40. 

BItTERMAN, M. E., Fepperson, W. E., & 
Tyzer, D. W. Secondary reinforcement 
and the discrimination hypothesis. mer. 
J. Psychol., 1953, 66, 456-464. 

Bower, G. H. A contrast effect 
ential conditioning. J. erp. Ps 
62, 196-199. 

Bower, G. H., & Stocks, D. Graded fru 
tration effect as a function of graded re- 
duction in reward. Paper read at Psycho 
nomic Society, Chicago, 1960 

Brown, J. S., & Farner, I. E. 
conceptualized as intervening 
with suggestions toward a theory of fru 
tration. Psychol. Bull., 1951, 48, 465-495. 

Busu, R. R., & Mosterter, F \ 
matical model for simple learning 
chol. Rev., 1951, 58, 313-323 

Cantor, G. N., & Spiker, C. C. Effects of 
nonreinforced trials on discrimination learn 
ing in preschool children. J. exp. Psy- 
chol., 1954, 47, 256-258 

Capatpi, E. J., & Stevenson, H. W 
sponse reversal following different amounts 


Psych rf. 


VerpLAnck, W. S&S. & 
Che reversal of discrimi- 


exp 


learning as a 
nonreinforced to 
phystol Psy- 


in differ- 
hol., 1961, 


Emotions 
variables 


mathe- 


Psy- 


Re- 
of training J. comp. physiol 
1957, 50, 195-198 

D’Amato, M. R., & Jacopa, H. Effects of 
extinction trials on discrimination reversal. 
J. exp. Psychol., 1960, 59, 254-260 

Denny, M. R., & Dunnam, M. D 
effect of differential 
the incorrect response on the learning of 
the correct response in the simple T-maze. 
J. exp. Psychol., 1951, 41, 382-389. 
SHRENFREUND, D 


The 


nonreinforcement of 


An experimental test of 
the continuity theory of discrimination 
learning with pattern vision. J. 
physiol. Psychol., 1948, 41, 408-422 

<NINGER, M. U. The role of generalized ap- 
proach and avoidance tendencies in bright- 
ness discrimination J. comp. physiol 
Psychol., 1953, 46, 398-402 

Estes, W. K 
of learning. 


107. 


comp 


Toward a statistical theory 


Psychol. Rev., 1950, 57, 94- 


AMSEL 


Estes, W. K. The statistical approach to 
learning theory. In S. Koch (Ed.), 
Psychology: A study of science. Vol. 2. 
New York: McGraw-Hill, 1959. 

Ferster, C. B. Withdrawal of positive re- 
inforcement as punishment. Science, 1957, 
126, 509. 

Ferster, C. B. Control of 
chimpanzees and pigeons by time out from 
positive reinforcement. Psychol. Monogr., 
1958, 72(8, Whole No. 461). 

FestTincer, L. A theory of c 
sonance. Evanston, LIL: 
1957. 

FESTINGER, L. 
insufficient rewards 
1961, 16, 1-11. 

Firzwater, M. E 


behavior in 


rgnitive dis 

Row, Peterson, 

The psychological effects of 

lmer. Psychologist 

The relative effect of 
reinforcement and nonreinforcement in 
establishing a form discrimination. J. 
comp. physiol. Psychol., 1952, 45, 476-481 

GatiinGc, F. A study of the continuity of 
the learning process as measured by habit 
reversal in the rat. J. comp. physiol. 
Psychol., 1951, 44, 78-83 

Goopricu, K. P. Performance in different 
segments of an instrumental response 
chain as a function of reinforcement sched 
ule. J. exp. Psychol., 1959, 57, 57-63 

Grice, G. R., & GotpMan, H. M. General 
ized extinction and secondary reinforce- 
ment in visual discrimination learning 
with delayed reward. J. exp. Psychol., 
1955, 50, 197-200. 

Grove, G. R., & EN1NGER, M. | The rela 
tive importance of approach and avoidance 
tendencies in brightness discrimination 
learning. Paper read at Midwestern Psy- 
chological Association, Chicago, 1952. 

Haccarp, D. F. Acquisition of a 
running response as a function of partial 
and continuous schedules of reinforcement. 
Psychol. Rec., 1959, 9. 11 18. 

Haritow, H. F. The formation of learning 
sets. Psychol. Rev., 1949, 56, 51-65. 

Hartow, H. F. Analysis of discrimination 
learning by monkeys. J. exp. Psychol., 
1950, 40, 26-39 

Harvow, H. F., & Hicks, L. H. Discrimi- 
nation learning theory: Uniprocess vs. 
duoprocess. Psychol. Rev., 1957, 64, 104- 
109. 

Hortper, W. B., Marx, M. H., Hovper, E. 
E., & Cottier, G. Response strength as 
a function of delay of reward in a runway 
J. exp. Psychol., 1957, 53, 316-323. 

Hutt, C. L. Principles of behavior. 
York: Appleton-Century, 1943. 

Hutt, C. L. A behavior system. 
Haven: Yale Univer. Press, 1952. 


simple 


New 





FRUSTRATIVE NONREWARD 


Hus, «. L. 
O., & Barker, A. N. 
esophageal feeding as 
J. comp. physiol. Psychol., 
245. 

Hutse, S. H., Jr. Amount and percentage 
of reinforcement and duration of goal 
confinement in conditioning and extinc 
tion. J. exp. Psychol., 1958, 56, 48-57. 

Humpureys, L. G. The effect of random 
alternation of reinforcements on the ac 
quisition and extinction of conditioned eye 
lid reactions. J. exp. Psychol., 1939, 285, 
141-158. 

Humpnreys, L. G. condi 
tioned psycho-galvanic response following 
two conditions of reinforcement. J. exp 
Psychol., 1940, 27, 71-76 

Jenkins, H. M. The effect of discrimina 
tion training on extinction. J. exp. Psy 
chol., 1961, 61, 111-121. (a) 

Jenkins, H. M Resistance to extinction 
when partial reinforcement is followed by 
various amounts of regular reinforcement. 
Paper read at Psychonomic Society, New 
York, 1961. (b) 
-ENDLER, H. H., Piiskorr, 
M. R., & Karz, S. 


versus reinforcements as 


Rouse, R. 
True, sham, and 
reinforcements 


1951, 44, 236 


Livincston, J. R., 


Extinction of 


S. S., D’AMArTo, 
Nonreinforcements 
variables in the 
partial reinforcement effect. J. exp. Psy- 
chol., 1957, 53, 269-276. 

CRECUEVSKY, I. 
Psychol. Rev., 1932, 39, 516-532 
.RECHEVSKY, I. A study of the continuity 
of the problem-solving process. Psychol 
Rev., 1938, 45, 107-133 

ACHMAN, R. The influence of thirst and 
schedules of reinforcement-nonreinforce 
ment ratios upon brightness discrimination. 
J. exp. Psychol., 1961, 62, 80-87. 

AMBERT, W. W., & Sotomon, R. L.  Ex- 
tinction of a running response as a func 
tion of distance of block point from the 
goal. J. comp. physiol. Psychol., 1952, 45, 


269-279. 


“Hypotheses” in rat 


ASHLEY, K. S. An examination of the 
“continuity theory” as applied to discrimi- 
nation learning. J. gen. Psychol., 1942, 
26, 241-265. 

,AWRENCE, D. H. Acquired distinctiveness 
of cues: I. Transfer between discrimina- 
tions on the basis of familiarity with the 
stimulus. J. exp. Psychol., 1949, 39, 770- 
784. 


AWRENCE, D. H. Acquired distinctiveness 
of cues: II. Selective association in a 
constant stimulus situation. J. erp. Psy 


chol., 1950, 40, 175-188. 


327 


LAwreENcE, D. H. The transfer of a dis- 
crimination along a continuum. J. comp. 
physiol. Psychol., 1952, 45, 511-516. 

LAWRENCE, D. H. Learning. Annu. Rev. 
Psychol., 1958, 9, 157-188. 

Lawson, R., & Marx, M. H. Frustration: 
Theory and experiment. Genet. psychol. 
Monogr., 1958, 57, 393-464. 

LocaN, F. A., Berer, E. M., & 
W. D. Extinction following partial and 
varied reinforcement. J. exp. Psychol., 
1956, 52, 65-70. 

Loneostretu, L. E. The relationship between 
expectations and frustrations in children 
Child Develpm., 1960, 31, 667-671. 

Marzocco, F. M. Frustration effect as a 
function of drive level, habit strength and 
distribution of trials during extinction. 
Unpublished doctoral dissertation, State 
University of Iowa, 1951. 

Mitier, N. E. Experimental studies of con- 
flict. In J. McV. Hunt (Ed.), Personality 
and the behavior disorders. New York: 
Ronald, 1944. 

Mitter, N. E. Theory and experiment re 
lating psychoanalytic displacement to 
stimulus-response generalization. J. ab- 
norm, soc. Psychol., 1948, 43, 155-178. 

Miiter, N. E. Learning resistance to pain 
and fear: Effects of overlearning, ex 
posure, and rewarded exposure in context. 
J. exp. Psychol., 1960, 60, 137-145. 

Miuier, N. E., & Stevenson, S. S. Agi- 
tated behavior of rats during experimental 
extinction and a curve of spontaneous re 
covery. J. comp. Psychol., 1936, 21, 205 
231. 

Mowrer, O. H. 
havior. New York: Wiley, 1960. 

Nissen, H. W. Description of learned re- 
sponses in discrimination behavior. Psy- 
chol. Rev., 1950, 57, 121-131. 

Nortu, A. J., & Strmmet, D. T. 
tion of an instrumental response follow 
ing a large number of reinforcements. 
Psychol. Rep., 1960, 6, 227-234. 

Oxups, J. The influence of practice on the 
strength of secondary approach 
J. exp. Psychol., 1953, 46, 232-236 

Oups, J. The growth and structure of 
motives. Glencoe, Ill.: Free Press, 1956. 

Paviik, W. B. The effects of deprivation 
schedule and reward magnitude on per- 
formance during acquisition and extinc- 
tion. Paper read at Eastern Psychologi- 
cal Association, Philadelphia, April 1961. 

Penney, R. K. The effects of non-rein- 
forcement on response strength as a func- 
tion of number of previous reinforcements 
Canad, J. Psychol., 1960, 14, 204-215. 


KINCAID, 


Learning theory and be 


Extine 


drives. 





328 ABRAM 


Punots, B. H., Jr. The facilitation of visual 

spatial discrimination reversal by 
overlearning. J. comp. physiol. Psychol., 
1956, 49, 243-248 

Rew, L. S. The development of noncontinu- 
ity behavior through continuity learning. 
J. exp. Psychol., 1953, 46, 107-112. 

Reynoips, W. F., Ricuter, M. L., & Car- 
Lock, J. extinction 
function of reward magnitude and number 
of training trials. Paper read at Psycho- 
nomic Society, Chicago, 1960 

Reynoips, W. F., & Srecet, M. H 
reversal effect as a function of magnitude 
of reward. Psychol. Rep., 1961, 8, 135 
141. 

Rourer, J. H 
ing from non-reward. J. « 
1949, 42, 476-485 

RoSENZWEIG, S 
theory. In J. McV. 
ality and the behavior dis 
York: Ronald, 1944 

Rotter, J. B 
psychology. 

1954. 

Rouse.t, J. Frustration func 
tion of repeated non-reinforcements and as 
a function of the consistency of reinforce 
ment prior to introduction of nonrein 
forcement Unpublished master’s 
Tulane University, 1952 


and 


Resistance to as a 


Drive 


A motivational state result- 
mp. Psychol., 
An outline of frustration 
Hunt (Ed.), Person 
New 


rders 


and clinical 


Prentice-Hall 


; 
carning 


York: 


effects as a 


thesis, 


for 


Psy- 


Sewarp, J. 


the 
hol 


P. Experimental evidence 

motivating function of reward 
Bull., 1951, 58, 130-149 

Sewarp, J. P., Peresoom, A. C., But er, 
B., & Jones, R. B. The role of prefeed 
ing in apparent frustration effect. J. exp 
Psychol., 1957, 54, 445-450. 

SHEFFIELD, V. F function 
of partial reinforcement and distribution of 
practice. J. Psychol., 1949, 39, 511 


526. 


Extinction as a 


exp 

SHEFFIELD, V. | Resistance to extinction 
as a function of the distribution of extinc 
tion trials. J. exp 1950, 40, 305 
313 

SHOEMAKER, H. A. The relative efficiency 
of rewarded and non-rewarded training in 
a black-white discrimination problem 
Unpublished doctoral dissertation, Univer 
sity of Colorado, 1953 
The organisms 

Appleton-Century, 1938. 

Sotomon, R. L The externalization of 
hunger and frustration drive. J. comp 
physiol. Psychol., 1956, 49, 145-148. 


Psy hol > 


SKINNER, B. F. hehavior of 


New York: 


AMSEL 


Spence, K. W. The 
tion learning in animals. 
1936, 43, 427-449. 

Spence, K. W. Analysis of the formation 
of visual discrimination habits in the 
chimpanzee. J. comp. Psychol., 1937, 23, 
77-100. 

Spence, K. W. Continuous 
continuous interpretations of 
tion learning. Psychol. Rev., 
271-288. 

Spence, K. W. An experimental test of the 
continuity and non-continuity theories of 
discrimination learning. J. exp. Psychol., 
1945, 35, 253-266 

Spence, K. W. Behavior theory and condi 
tioning. New Haven: Yale Univer. Press, 
1956. 

Spence, K. W. 
learning. Englewood 
Hall, 1960. 

STanLey, W. C. Extinction as a 
of the spacing of extinction trials. J. 
Psychol., 1952, 43, 249-260. 

Tuelos, J. The partial reinforcement effect 
sustained through blocks of continuous 
reinforcement. J. exp. Psychol., 1962, in 
press. 

Tyrer, D. W., Marx, M. H., & Cottier, G 
Frustration stimuli in discrimination. J. 
exp. Psychol., 1959, 58, 295-301. 

Wacner, A. R. The treatment of inhibition 
in the more complex behavioral situation. 
Paper read at American Psychological As 
sociation, 1961. (b) 

Waaner, A. R. The role of reinforcement 
and nonreinforcement in an “apparent 
frustration effect.” J. Psychol., 1959, 
57, 130-136. 

Wacner, A. R. Effects of amount and per- 
centage of reinforcement and number of 
acquisition trials on conditioning and ex 
tinction. J. erp. Psychol., 1961, 62, 234 
242. (a) 

Warp, J. S. An experimental investigation 
of the role of frustrative factors in 
crimination learning. Unpublished 
toral dissertation, Tulane University, 1962. 

WeEtNstTock, S. Resistance to extinction of 
a running response following partial re- 
inforcement under widely spaced trials 
l, comp. physiol. Psychol., 1954, 47, 318 
322. 

Witson, W., Wess, E. J., & Amset, A. 
Two tests of the Sheffield hypothesis con- 
cerning resistance to extinction, partial 
reinforcement, and distribution of practice. 


J. exp. Psychol., 1955, 50, 51-60. 


nature of discrimina 


Psychol. Rev., 


versus non- 
discrimina- 


1940, 47, 


Behavioral theory 
Cliffs : 


and 


Prentice 


function 


exp 


rp 
ex] 


dis- 


doc- 


(Received November 20, 1961) 





Psychological Review 
1962, Vol. 69, No. 4 


i) 32 343 


THE SELECTION OF STRATEGIES 
IN CUE LEARNING! 
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In a cue learning problem (dis- 
crimination learning, concept forma- 
tion, and maze learning) the subject 
chooses one of two or more responses 
The correctness of the 
response depends on some aspect of 


on each trial. 


the situation at the time of response. 

In this it is assumed that 
subjects difficulty with cue 
learning problems to the degree that 
they 
patterns of response) which conflict 
with the strategy intended by the 
experimenter. It often assumed 
that the subject must also associate 
with in the 
However, in most cue 
learning experiments the subject 
instructed or pretrained to make the 
before the 
begins. Further- 
more, the experimental situation con- 
strains the subject to make exactly 
of the available per 

It seems reasonable to assume 


paper 
have 


tend to use strategies (habits, 


1S 


correct responses cues 
situation. 
is 
desired 


responses cue 


learning process 


one 
trial. 
that cue learning is not so much a 
matter of the formation of the 
selection of responses. The present 
theoretical discussion begins with a 


responses 


as 


set of strategies and is concerned with 


the mechanisms by which the subject 


might select out those strategies in- 
tended, consistently rewarded, 
by the experimenter. 
The “strategy” 
in a sense related to the more common 


and 


term is employed 
terms, “habit’’ and “hypothesis,” to 


designate a particular pattern of re- 

: by the 
Research Fellow 
Research Council, 


This research facilitated 
author’s tenure as Faculty 
of the Social 
1959-61. 


was 


Science 


329 


stimuli. Consider, for 
example, a rat being trained to jump 
to the larger of two white circles in a 
Lashley Jumping Stand. Some pos- 
sible strategies include; jumping to 
the left side, jumping to the right 
jumping alternately left and 
right, jumping always to the side last 
reinforced, jumping to the side which 
last was punished, jumping in the 
direction the rat was placed down on 
the stand, jumping toward the circle 
with less (or more) acute curvature, 
jumping toward the side with more 
white area, jumping toward the side 
which reflects more total light, and so 
forth. Each such pattern of behav- 
ior, if it occurs in isolation for a suffi- 
ciently long sequence of trials, can be 
identified by the experimenter. It 
will be seen that there are ordinarily 
a large number and variety of possible 
strategies in a cue learning problem, 
and that a number of strategies may 
be perfectly confounded. In the ex- 
ample above, jumping to the circle 
with less acute curvature will always 
lead to the same result as jumping to 
the side which reflects more light, 
since these will both be jumps to the 


sponses to 


side, 


larger circle. 


THREE MODELS OF THE 
SELECTION OF STRATEGIES 


The general idea developed here is 
that the problem gives rise to a set of 
strategies. The subject uses these 
various strategies, which at first are 
selected at random, and attempts to 
select strategies which will consist- 
ently lead to correct responses. For 
simplicity the discussion is limited to 
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problems in which there are at least 
some strategies which are always 
correct. ‘ 

Let H be the set of strategies avail- 
able to the subject at the beginning 
of the problem. Suppose that some 
subset C of these strategies are always 
correct, another subset W are always 
wrong, and the remainder, J, are 
sometimes correct and sometimes 
wrong. For simplicity of presenta- 
tion, begin with a problem in which 
the strategies of J are correct and 
wrong, on successive trials, with in- 
dependent probabilities all of which 
are 4, and restrict attention to the 
two-choice problem. 

Presentation of the theory of this 
conventional discrimination learning 
problem will begin with a special case 
in which one strategy is used on each 
trial. Then an alternative theory 
will be proposed, in which the subject 
uses all strategies at once and attempts 
to narrow down to the correct one. 
Third, a model will be proposed in 
which the subject chooses a random 
sample of all strategies and attempts 
to narrow down to the correct strate- 
gies within his sample. These three 
models will be shown to be essentially 
equivalent, in a theorem which may 
be called the “indifference to sample 
size’ theorem. Following proof of 
that theorem, some empirical impli- 
cations of the theory will be derived 
and compared with relevant data. 


One Strategy at a Time 


On the first trial of the experiment, 
the subject selects a certain single 
strategy from 7 at random and makes 
the indicated If the re- 
sponse is correct, the same strategy 
If it is again cor- 


response. 


is used on Trial 2. 
rect, it is used again on Trial 3, and 
soforth. If, on any trial, the response 
is incorrect, the subject returns his 
strategy to the set 7 and then chooses 


RESTLE 


It is 
same 


a strategy again at random. 
considered -possible that the 
strategy may be resampled, so that 
the sampling process is with replace- 
ment. Imagine that an error occurs 
on Trial 2. Then a new strategy is 
chosen and this strategy is used on 
Trial 3. If it is correct it is used 
again on Trial 4. If the response on 
Trial 3 is wrong, the subject again 
chooses an hypothesis at random for 
Trial 4. 

Notice that if the subject chooses a 
correct strategy, will make no 
more errors. The correct strategy 
leads to correct responses, and when 
the response is correct the subject 
does not change strategy. Thus, the 
learning process is terminated by the 
choice of a correct strategy. 


he 


Since sampling is with replacement, 
the probabilities of choosing a correct, 
a wrong, or an irrelevant strategy are 
constant from trial to trial. Let these 
three probabilities be c, w, and 7 
l—c-—w. 

At the beginning of training the 
subject chooses a strategy of one of 
the three types, which have probabili- 
ties c, w, and 7. After each error he 
also chooses a strategy, with prob- 
abilities c, w, and z. Thus an error is 
an event which returns the subject to 
exactly the same condition (in terms 
of the probabilities of future events) 
with which he started—it resets the 
process and makes it over. 
Such an event is called a “recurrent 
event” (Feller, 1950). Since our 
experimental interest centers around 
errors and correct responses, we can 
limit attention to errors and thus to 
the theory of recurrent events. 


begin 


Imagine that at some Trial the 


subject makes an error. There is a 
certain probability f; that the next 
error occurs at Trial » + 1, a prob- 
ability f. that the next error occurs at 
Trial n + 2, and in general a prob- 
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ability distribution f; that the next 
error after Trial m occurs at Trial 
n+j. In a model of recurrent 
events like the present one, the dis- 
tribution f; is the same for all Trials n 
on which an error occurs, and is the 
same as the distribution of probabili- 
ties that the first error occurs on 
Trial 1, 2, etc. 

The random process and all of its 
properties are specified by the dis- 
tribution f;. However, the analysis 
of the process from the distribution f; 
will be postponed until two steps have 
been completed; first, calculation of 
the distribution f ; for the one-strategy- 
at-a-time model, and second, the 
formulation of two other models, very 
different in assumptions, which also 
lead to systems of recurrent events 


with the same distribution. When 


these steps are completed, the general- 
ity of the f distribution will be appar- 
ent, and derivations of properties of 


the process will be justified. 

First, f; is the probability of an 
error immediately following another 
error. The second error can occur 
because a wrong strategy was chosen 
(with probability w) or because an 
irrelevant strategy was chosen and 
turned out to be wrong (with prob- 
ability 42). These are mutually ex- 
clusive events and exhaust the possi- 
bilities, so that: 


fi = wt 4. 


An error occuring on the second trial 
after the last one, with a correct re- 
sponse intervening, cannot result from 
the selection of a wrong strategy for 
the wrong strategy would not lead to 
an intervening correct response. If 
the error does occur on the second trial 
we know that the strategy chosen was 
not correct. Hence it must be an 
irrelevant one which led first to a 
correct and then to a wrong response. 
In simple discrimination learning, the 
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probability that an irrelevant strategy 
would lead to a correct and then a 
wrong response is 4-3 = (3). The 
probability of choosing an irrelevant 
strategy which gives this sequence, 
correct-wrong, is therefore : 


fr =i)" 


By the same reasoning, any value of 


f;, 7 > 2, implies that an irrelevant 


strategy was chosen and then led to 


j — 1 correct responses followed by a 


wrong response. The probability of 
a string of exactly 7 — 1 correct re- 
sponses each with probability 4, fol- 
lowed by an error with probability 
s, is (3)’. Hence, for 7 > 2: 


f; = 1(3)? 
Summarizing the results we have the 
distribution : 

fi=wth 
f; = i(3)? 
for all 7> 2. The distribution f; is 
not a proper probability function 
because it does not sum to unity. 
Notice that: 
D 


> (fp)=wti > ()i=wti=1—c 
j=1 


7=1 


In Feller’s terms this means that 
errors are uncertain recurrent events, 
for with probability c the subject 
chooses a correct strategy and never 
makes another error. Some of the 
probability of the f distribution is 
located at the (improper) point ~, 
and the proportion so located is c. 
This merely reflects the fact that 
learning occurs in the present model 
and the subject can eliminate errors. 
The fact that f; is not a probability 
function does not place any serious 
difficulties in the way of analysis. 


All Strategies at Once 


We now consider a second model 
of a subject working on the same dis- 
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crimination learning problem dis- 
cussed above, with exactly the same 
set of strategies 7 divided into correct 
(C), wrong (W), and irrelevant (/) 
subsets. Since a strategy corresponds 
to any specific cue in the situation 
and these cues are numerous and 
finely divisible, consider that the set 
of strategies consists of a fairly size- 
able (though finite) number of strate- 
gies, each of which is as likely to be 
used, and has as much influence on 
behavior, as every other. 

In the all-strategies-at-once model 
we imagine that the subject begins 
the learning process by considering 
the entire set 7/7 of strategies simul- 
taneously. In informal terms, he 
then attempts to narrow down to the 
correct strategy by a process of elim- 
ination. However, we suppose that 
the subject has only a limited memory 
and cannot remember all of the strate- 
gies at once, but only those with which 
he is presently concerned. The main 
disability of the subject in this model 
is that he may accidentally eliminate 
the correct When this 
happens he is unable to remember 
what they were, for he can remember 
only the ones he is still using. When 
the correct strategies have been lost 


strategies. 


the subject has no recourse but to 
whole set of 
This general idea is now 


begin over with the 
strategies. 
stated more precisely. 

On the first trial the subject con- 
siders all strategies and chooses his 
response. The correct strategies and 
some of the irrelevant ones lead to- 
ward a correct response, the wrong 
strategies and the remainder of the 
irrelevant ones lead to an error. We 
suppose that the probability of a re- 
sponse is equal to the proportion of 
the strategies lead to that 
response, so that, if we let N., Nw, 
and N; be the numbers of strategies 
W, and J, and if we let 


which 


in sets C, 


fi (Nw + Nayi)/(Ne + Nu 
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N= be the number of irrelevant 
strategies which lead to an error on 
the next trial, the probability of an 
error is: 


fi = (Nw + Ni-)/(No + Ne + Ni). 


When the subject makes this first 
response we suppose that he elimi- 
nates or sets aside all the strategies he 
does not use. Suppose, for example, 
that he chooses a correct response. 
He at the same time eliminates all 
the wrong strategies and the irrelevant 
strategies which would have led him 
to the opposite response. On the next 
trial he will have only the correct 
strategies and the surviving irrelevant 
strategies left. 

To complete the discussion and 
generalize it somewhat we introduce 
notation for the set of irrelevant 
strategies which, following any Trial 
n, first lead to a wrong response on 
Trial m + j7. Let N,,; be the number 
of irrelevant strategies which lead to 
responses on trials m + 1, 
n+2,. ,n + 7 — 1, and an error 
on trial +7. Now we can write 
the probability of an error after an 


correct 


error as: 
+ N;). 


After one correct response, the only 
strategies which will lead to an error 
Nn,2 irrelevent 
lead to an error on the 
second trial. The total strategies 
left are N. correct and N; — Na, 
irrelevant strategies which have not 
already led toward a wrong response. 
Thus, the probability of an error fol- 
which 


are those strategies 


which first 


lowing one correct response 
follows a given error is: 

fo= (1—f1)(Nn.2/LNe+tNi— Nn.) 
N.+Ni-— Nai Na.2 
N.tNwtNi Ne+Ni-Nav 
Extension of this line of argument 
shows that in general: 
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N. + Ni — Naa Ne + Ni — Naa - 
N.+ Ni -— 1 


N. + No + Ni 
Ne+Ni-Naa-...—Nnj-2- 
N.+Ni-— Naim... —Nejp-3- 


In this expression the numerator of 
each term except the last cancels with 
the denominator of the following 
term, so that: 


fi = Na.i/(Ne + No + Ni) 


In our ideal experiment, N,,;, the 
number of strategies which first lead 
to a wrong response on Trial ” + 7 
after a given error on Trial n, is in- 
dependent of m and is given by N;($)’. 
This statement follows from the idea 
that irrelevant strategies are correct 
and wrong with independent prob- 
abilities of $. Actually, N,;(4)? is the 
expected number of irrelevant strate- 
gies which first lead to an error on 
Trial » +7, but the derivation to 
follow does not hinge on the distinc- 
tion. 

If we substitute N;($)/ for N,,; in 
the expression for f; and simplify, 
letting 7 be the proportion of irrele- 
vant strategies in H/7, we obtain: 

f; = 4(4)4. 
A similar substitution in the expression 
for fi gives: 


fi=wt ht. 
These two results are in agreement 
with the one-strategy-at-a-time 
model. We have shown that the 
one-strategy-at-a-time and the all- 
models are both 
systems in which errors are recurrent 
events, and we have shown that f; is 
the same in both systems. Hence, so 
far as errors are concerned, the two 
models are identical. 

One remark is in order. In the one- 
strategy-at-a-time model, the proba- 
bility of a correct response on any 


strategies-at-once 


STRATEG 


IES IN CuE LEARNING 


N 
4¥n,2 


Nas 


Nn, j—1 
Na, j-—2 N. + N; as 


—Nu,; i— N; af 
trial is 1, 3, or 0 depending on whether 
a correct, irrelevant, or wrong strategy 
is being used. The probability of a 
correct response in the all-strategies- 
at-once model can take a variety 
of values; if there are a great many 
strategies, the possible values of this 
probability are numerous and closely 
packed. The two models are not 
at all alike in the probabilities in- 
volved, and the theorem that they 
give indistinguishable sequences of 
errors is by no means obvious on 
intuitive grounds, even though the 
proof is easy. 

The correspondence of the models 
extends to the interpretation of the 
parameters. In the one-strategy-at- 
a-time model the parameters c, w, 
and 7 are the probabilities of selecting 
a correct, wrong, or irrelevant strat- 
egy. In the  all-strategies-at-once 
model the parameters c, w, and 7 are 
the proportions of correct, wrong, and 
irrelevant strategies, where elemen- 
tary strategies are taken to have equal 
weight. In terms of experimental 
realizations of the models, these two 
sets of parameters are not distinguish- 
able. 

It is still possible, of course, to tell 
which model is more reasonable by 
any technique which goes beyond the 
choice data. One might, for example, 
ask human subjects which strategies 
they are using, or one might use 
memory tests to see what parts of the 
situation have been employed. The 
equivalence of the models is asserted 
only for the acquisition of a single 
problem, and only with respect to 
the overt choices made: the writer will 
suggest, in subsequent work, that rec- 
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ognition and transfer tests may per- 
mit differential predictions. 


A Random Sample of Strategies 


The two models discussed 
are extremes. One would intuitively 
imagine that the subject would use, 
not all strategies and not just one, 
but some sample of strategies. The 
third model, called the random-sam- 
ple-of-strategies model, supposes that 
on the first trial and after each error 
the subject draws a random sample 
of all strategies. So long as he makes 
no errors the subject discards wrong 
and irrelevant strategies by the same 
mechanism as that invoked in the all- 
strategies-at-once model. If the sub- 
ject makes an error he takes a new 
independent sample from H/ and be- 
gins the process over. 

The theorem to be proved is, if the 
sampling process is random in a strict 


above 


sense to be described below and ex- 
cludes the possibility that the sub- 
ject might choose the empty sample, 
then the random-sample-of-strategies 
model generates a system of recurrent 
events which is the same as the other 
two models given above. 

We again consider the set 7 of 
strategies, composed of equally potent 
elementary strategies. Suppose that 
there are N strategies in H of which 
N. are correct, N, are wrong, and 
N, are irrelevant. We shall interpret 
N./N as c, the proportion of correct 
strategies, N,,/N as w, and N,/N as 1. 

In a random sampling model of 
learning one may assign a fixed prob- 
ability of being sampled to each ele- 
ment and let the sample size vary 
randomly, or assign a fixed sample- 
size, supposing that all such samples 
are equally probable (Estes, 1959.) 
To eliminate the possibility of the 
empty sample it is desirable to deal 
with a fixed sample size nm. The 
random-sampling specification is that 
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all samples of size m are equally prob- 
able. 

The total number of samples of size 
n which can be drawn from a popula- 
tion of N elements is: 


N! 


(") =a 
n} n'(N—n)! 


~* . N. 
Similarly, one can draw sam- 
Nn. 


ples of m, correct strategies from the 
population of N, strategies. For each 
of these samples of n, correct strategies, 


a 


there are samples of m. wrong 


: Ne\f Nw 
strategies, so that there are c n 


ways of drawing m, correct and my 
wrong strategies. For each of these 


N; 
combinations there are ( ways 
n; 


of drawing n; irrelevant strategies. 
Hence the total number of ways one 
could obtain 7, correct, 2, wrong, and 
n; irrelevant strategies is: 


( N.\f Nw \f Ni 
nN. Be n; 
where, of course, it is assumed that 
Ne + Nye +n; = n. Since every sam- 
ple of size m is assumed to be as likely 
as any other, the probability of draw- 
ing a sample of exactly m, correct, 
Nn» wrong, and n; irrelevant strate- 
Nw 


N. N; 
n-J\ Nw n; 
N 
( n ) 

which is the three-category generaliza- 
tion of the hypergeometric distribu- 
tion (Feller, 1950). 

It was asserted, in the statement of 
the model, that once the subject has 
chosen a sample his performance 


follows the rules of the all-strategies- 
at-once model. Hence, after any 


gies is: 


P(n., Re, %i) = 
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error and the choice of a sample with 
Ne, Nw, and n; strategies, we can con- 
struct the f; distribution. We may 
write the f distribution conditional on 
the constitution of the sample as: 


f(j|sample with n,, nw», n; strategies) 


forj7 = 1 
forj > 1 


| (Mw + 3n,)/n 
| (4)in,/n 


directly from our results on the all- 
strategies-at-once model. To obtain 
the unconditional f; we multiply each 
conditional f by the probability of the 
This gives us the 
conditional ff. 


sample, and sum. 
expectation of the 
For f; we have: 
fy E[ (". + }n;)/n ] 

E(n,./n) + 3E(n;/n) 
However, in the hypergeometric dis- 
tribution the proportion of 
wrong strategies in samples of size n 


mean 


is just the proportion of wrong strate- 
gies in the whole population. That 
is, E(n,/n) = N,/N =w. Similar 
equations hold for irrelevant and cor- 


rect strategies. Hence: 


yi l . Ai 
and, by the same argument: 

f; (4)% 
for j greater than 1. Thus it is shown 
that the random-sample-of-strategies 
model with fixed sample size n is a 
system of recurrent events (this point 
is obvious and is not proved above) 
with the same f distribution found in 
the one-strategy-at-a-time and the all- 
strategies-at-once models. At this 
point one can see that the one-strat- 
egy-at-a-time model is a special case 
of the random-sample-of-strategies 
model with sample size n 1; and 
similarly, the  all-strategies-at-once 
model is a special case with sample 

sizen = N. 

One further generalization is possi- 
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ble. Since the final equations of the 
random-sample-of-strategies model 
contain no reference to the sample 
size m, one would get a system of re- 
current events with the given f dis- 
tribution from any mixture of systems 
which use different sample sizes. 
One can withdraw the assumption 
that the sample size is fixed, permitting 
the possibility that samples are of all 
different sizes. The only restrictions 
are that for any , all samples of size 
n are equally likely; and there are no 
empty samples. 

The strategy-sampling model differs 
from Estes’ stimulus-sampling model 
(Estes, 1959) in an important respect. 
Let the total number of elements 
(strategies or, in Estes’ case, stimulus 
elements or patterns) be N and the 
sample size be n. In Estes’ formula- 
tions, #/N is analogous to the learning 
rate 6 and controls the rate of learning 
and all the statistical characteristics 
of the acquisition data. In the pres- 
ent model, learning is independent of 
the sample size n/N and depends 
solely on the composition of the basic 
set JJ; namely, the proportions of 
correct, wrong, and irrelevant strate- 
gies in 77. Thus while the sampling 
process invoked in this strategy- 
sampling theory is one of those used 
by Estes, its place in the theory is 
entirely different. 

In this section it has been shown 
that three models of the selection of 
strategies all lead to the same system 
of recurrent events. Whether or not 
this is of interest depends largely on 
whether the resulting description fits 
the data of cue learning experiments, 
a question which is considered in the 
remainder of the paper. 


STATISTICAL PROPERTIES Ol 
THE Raw Data 


The discussion above has been re- 
stricted to the distribution f;, the 
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probability that an error follows the 
last error by exactly j trials. The f 
distribution can be estimated directly 
from raw data, but this is neither a 
conventional nor a very interesting 
way of describing the data of cue 
learning. In this section the data 
generated by this model are analyzed 
in several of the ways commonly em- 
ployed by experimenters. Details are 
given in Restle (1961). 

Learning Curve(s) 
three versions of the 
One is the succession 


Consider 
learning curve. 
of correct and wrong responses by an 
individual subject. A second is the 
average learning curve of a group of 
subjects. The third is a corrected or 
idealized form of the learning curve 
computed by adjusting a group learn- 
ing curve (Vincentizing) or by averag- 
ing the data of subjects who are 
selected after the fact for similarity 
of over-all performance (Spence, 
1956). 

According to the present theory, 
the individual data are composed of a 
wrong re- 
sponses in irregular order, followed 
by an infinite correct 
responses. If w > 0, will be 
errors correct 


sequence of correct and 
sequence of 
there 
somewhat than 
responses on the resolution trials, and 


more 


there will be a tendency for errors to 
follow other errors more often than 
errors follow correct responses. The 
probability of an error following an 
error will be (w + 47)/(w + 7) where- 
as the probability of an error following 
a correct response is}. An individual 
subject will produce below- 
chance behavior for block of 
trials and then abruptly, after some 
error, will either happen on a correct 
strategy (in the one-strategy-at-a- 
time model) or will begin a process 
of elimination which ends up with all 
‘his is an extreme 


such 
some 


correct strategies. 


form of the “discontinuous” or 
insightful learning curve. Unfortu- 
nately it is difficult to decide whether 
any individual subject does or does 
not exhibit this pattern, so data are 
usually combined. 

The group learning curve is merely 
the average of a set of (theoretically) 
discontinuous individual curves. If 
all subjects have the same parameters 
c, w, and 2, they will nevertheless 
happen to master the problem at 
different trials, so that the average 
learning curve is gradual. Its mathe- 
matical form is complex but in general 
appearance, the group learning from 
this theory resembles the 
“orowth” curve (Restle, in 1961). 


common 


If one selects, from a larger group of 
subjects, those who make the same 
number of total errors, or those who 
reach criterion at the same time, and 
averages their learning curves; or if 
one rescales learning trials as by the 
Vincent-curve method, the resulting 
curve will usually be S shaped. The 
writer has investigated this question 
by generating data which arise di- 
rectly from the assumptions of the 
use of tables of 
random When, from a 
large set of such data, one selects a 


present model, by 
numbers. 


subgroup of “‘subjects” who all make 


the same number of errors, or who 
reach criterion at about the 
trial, and average the performance 
within such a subgroup, the result is 
an S shaped curve. Exact Vincen- 
tizing produces a flat (stationary) 
learning curve before criterion. 


same 


The reason for the S shaped curves 
is not difficult to find; the above 
methods of selecting or rearranging 
the data tend to put the (theoretically 
random) times of solution close to- 
gether. If the times of solution are 
put exactly together a step-function 
should result, but if the times of solu- 
tion are only grouped close to one 
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another, the step is blurred and an 
S shaped curve results (Spence, 1956). 


Summary Statistics of the Data 


In the present theory the actual 
trial on which learning takes place is 
not in any way fixed, but depends 
upon the random outcome of the 
process of selecting strategies. The 
result is large intrinsic variance in the 
acquisition phase, which can most 
conveniently be described in terms 
of the total errors made by each sub- 
ject. If each subject is trained until 
a long sequence of consecutive correct 
responses has been obtained, one is 
reasonably sure that the number of 
errors made approximates the theoret- 
ical total errors. 

On the first trial or after any error 
the subject may either make another 
error, sometime later, or he may make 
no more errors. The probability that 
the subject will make at least one 
more error, the first one in exactly 7 
trials, is f;. Thus the total probabil- 
ity of at least one more error is: 


With probability c the subject never 
makes another error at all. From 
this it is not difficult to show that the 
probability of exactly k errors is 
(1 — c)'c. This is the geometric dis- 
tribution which has mean: 


E(k) (1 — c)/e 


and variance: 


9 


Var (k) (1 — c)/¢e 
The standard deviation of the distribu- 
tion of error scores should be nearly 
equal to its mean, according to the 
theory, and the distribution should 
show an extreme positive skewness. 
Provided that irrelevant strategies 
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are correct just half the time at ran- 
dom, trials-to-criterion behaves very 
much like total errors. 


Methods of Estimating the Parameters 
w and c 


An important step in any mathemat- 
ical development of a learning theory 
is the estimation of parameters. 
Fortunately, in the present model 
quite simple and efficient estimates 
are available. A maximum-likelihood 
estimator of c is given by: 


c= 1/(T + 1) 


where T is the mean total error score 
of a group of subjects. The variance 
of this estimate, with NV subjects in the 
group, is: 
Var (é) ce(1 —c)/N 

It is also possible to estimate w by the 
maximum-likelihood method. As 
was mentioned earlier, a high fre- 
quency of consecutive errors in the 
presolution phase is an indication of a 
relatively large proportion of wrong 
strategies, whereas a chance frequency 
of consecutive errors is an indication 
that there are relatively few wrong 
strategies; provided that the irrele- 
vant strategies are correct and wrong 
strictly at random. The method is 
to count “Trial 0,” an imaginary trial 
before training begins, as an error. 
Then for each subject divide this ex- 
panded set of errors into Mo errors 
which are followed by correct re- 
sponses and M, errors which are fol- 
lowed by errors. Then computing 
the means of these statistics one has: 


w= 


Mm- M,+1 


M, +. M, 


as a maximum-likelihood estimator of 
w (Restle, 1961). 
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SOME VARIATIONS OF THE 
Basic MopEL 


The theory stated above is so re- 
strictive in its assumptions that it 
cannot be applied with success to a 
great many experimental data. For 
example, with the theory above, one 
must expect the standard deviation of 
error scores to be just slightly less 
than their mean, and this is not ob- 
served with any great regularity. 
Furthermore, animal subjects often 
show a strong tendency to remain 
with one strategy even if it is wrong 
or irrelevant, for a fairly long string 
of trials. The model given above 
says that the subject starts over after 
each error, and this simply is not a 
plausible assumption for animal 
studies.’ 


Perseveration 


The model is quite easily general- 


ized to take some account of persever- 
In place of the 
assumption that the subject always 
resamples after an error, one supposes 


ation of strategies. 


that resampling occurs with 
probability r, which may be consid- 
ered a constant for given experimental 
conditions and also a constant during 
the presolution period. The subject 
chooses some strategy or sample of 
strategies at Trial 1, whence the prob- 
ability of making zero errors is c. 
The probability that any actual error 
is the last one is only rc, the joint 
probability that (a) the subject re- 
(6) having resampled, 
hits on a correct strategy (in the one- 
strategy-at-a-time model) or goes into 
a terminal process of narrowing down 
(in the all-strategies-at-once or the 
random-sample-of-strategies models.) 
For this model with perseveration the 


some 


samples and 


? The writer is indebted to Marvin Levine 
for pointing out the importance of this factor 
in anima! discrimination learning. 


distribution of total error scores is: 
P(0O) =c 
P(T) = (1 — oc) (A — re)? "re. 


where JT > 0. The value of this 
model in fitting animal data is illus- 
trated by analyzing a set of error 
scores reported in detail by Harlow, 
Harlow, Rueping, and Mason (1960), 
(See Restle, 1960). Harlow’s distri- 
butions show a relatively high fre- 
quency of solutions with zero errors, 
along with a spread to very large 
error scores, in original learning by 
baby monkeys. The result is that 
the standard deviations of the distri- 
butions are somewhat larger than the 
means. Harlow et al. remarked on 
the high degree of perseveration shown 
by some monkeys. Even more strik- 
ing is Warren's investigation (Warren, 
1959b) of the discrimination learning 
of cats, which show strong persevera- 
tive tendencies and which give distri- 
butions of error scores with standard 
deviations much larger than the corre- 
sponding means. 

It is not at all sure that persevera- 
tive effects can be reduced to the 
single parameter r, and the writer’s 
investigations are entirely insufficient 
to support the assumption. Within 
the model there are several symptoms 
of perseveration; a high frequency of 
zero error scores relative to the re- 
mainder of the distribution, higher 
standard deviation than mean errors, 
and consecutive runs of responses 
which follow a wrong or irrelevant 
strategy, such as a position habit, ob- 
ject preferance, etc. One can imagine 
that perseverative tendencies would 
be strong whenever the incentive for 
learning is not entirely effective, or 
when the subjects (perhaps from par- 
tial reinforcement) have developed 
an expectation that rewards cannot 
always be attained. Something of 
this last effect, in college students, is 
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suggested by the results of Morin 


(1955). 


Complexity of Correct Strategies 


In the model above it is tacitly 
assumed that the problem can _ be 
solved on the basis of a single correct 
strategy. This seems a reasonable 
assumption in the case of simultane- 
ous discrimination. However, when 
two situations are presented on suc- 
cessive trials it is a question whether 
complete solution involves learning 
one strategy (turn right to white and 
left to black) or two strategies, one 
for each situation. If two strategies 
are learned independently and with 
the same parameters, then the dis- 
tribution of errors is the distribution 
of the sum of two random variables 
(total errors in each situation) each 
of which has a geometric distribution. 
This results in a negative binomial 
distribution (see Restle, 1961) with 
mean: 


E(7) 2(1 — c)/e 


and variance: 
Var(T) = 2(1 — c)/e. 


These statements follow at once from 
the fact that the mean of the sum of 
two independent variables is the sum 
of the means, and the variance of the 
sum is the sum of the component 
variances. With small c the standard 
deviation will be about 1/v2 or about 
0.7 of the mean. 

An interesting fact can be brought 
forward in favor of this argument, 
though the writer has not been able 
to make an exhaustive test. In a 
case of successive size discrimination 
by college students (Restle, 1955) 
the writer found ratios of standard 
deviation/mean of 1.1 and 0.9 for two 
problems—both very close to unity, 
hence in agreement with the orig- 
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inal model and with the notion that 
a single strategy is sufficient to solve 
the problem. In a comparable study 
using rats in a successive black-white 
discrimination, Amsel (1952) obtained 
data on three groups in which the 
ratio of standard deviation to mean 
was 0.65, 0.68, and 0.72. The rat re- 
sults are in excellent agreement with 
the hypothesis that the rats had to 
use two separate strategies to solve 
the problem. One might guess that 
rats do not integrate the two habits, 
right turn to white and left turn to 
black, into a single cognitive struc- 
ture, whereas humans might make 
just such an integration. The present 
analysis may be useful in throwing 
further light on the nature of suc- 
cessive discriminations for various 
species of subjects. 


ADDITIVITY OF CUES 


In several papers regarding another 
theory the writer has discussed the 
additivity of cues (Restle, 1955, 1957, 
1958, 1959a; see also Bourne & Restle, 
1959, and Trabasso, 1960). In simple 
terms the experiment involves three 
groups; one learns a problem based 
on a set A of cues, the second learns 
a problem based on the set B of cues, 
and the third learns a problem which 
can be solved using either A or B 
cues disjunctively. If the sets A and 
B are separate then the third set 
A U Bshould have measure m(A LU B) 

= m(A) + m(B), (see Trabasso, 1960). 
Experimental results have been recon- 
ciled with an S-R theory involving 
adaptation of irrelevant cues (Restle, 
1955). 

Generally speaking, the calculations 
from the present model are in good 
numerical agreement with those from 
the ‘‘adaptation’” model. Reanaly- 
sis of three of the most satisfactory 
sets of data used before, and one set 
not previously used, are reported 
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here. The other data previously dis- 
cussed are so fragmentary that analy- 
sis will hardly be fruitful; and Tra- 
basso (1960) used such difficult prob- 
lems that many of his subjects failed 
to learn at all on the more difficult 
problems, making analysis by the 
present model unsatisfactory. In 
general, the estimates reported below 
are not the maximum-likelihood esti- 
mates because the subjects were not 
run to a strong criterion. However, 
since learning was nearly complete, 
the approximate estimates are ade- 
quate. 

Scharlock (1955; see also Restle, 
1957) ran a_ place-versus-response 
experiment with rats in which the 
relative weight of correct place (extra- 
maze) and correct response (intra- 
maze) strategies can be estimated. 
He also ran one group with both place 
and response strategies correct, and a 
group with response strategies correct 
and no place cues present. In the 
calculations below it is assumed that 
the number or wrong strategies of a 
given type (place or response) equals 
the number of correct strategies. 
This is asymmetry assumption which, 
while not strictly appropriate for 
Scharlock’s experiment, is needed to 
permit prediction, since separate esti- 
mates of wrong strategies cannot be 
made on the available data. 

Rats learned to go to the same place 
(using extra-maze cues), with response 
cues irrelevant, making an average of 
9.7 errors. Using the estimate ¢ = 
1/(1 + T), one estimates that the cor- 
rect place strategies make up 1/10.7 
= .093 of the total set of strategies. 
Other rats learned to make a constant 
response to different places, averag- 
ing 6.7 errors. We estimate that cor- 
rect response strategies constitute 
1/7.7 = .130 of the total set. 


The simplest interpretation of the 
experiment is that when both place 
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and response strategies would work, 
the proportion of correct strategies 
would be the sum of the proportions 
of place and response strategies, since 
the same total set of strategies is 
available. One predicts that with 
place-plus-response learning, c = .093 
+ .130 = .223. We have the formula 
that the expected errors to solution is 
(1 — c)/c. Hence the expected mean 
errors is .777/.223 = 3.4; the obtained 
mean was 4.0, which is adequately 
close. The discrepancy can partly be 
explained by the fact that the place- 
learning and response-learning groups 
were not run to a high criterion and 
probably would have made more 
errors if tested longer. The fast- 
learning place-plus-response group 
would likely not have made more 
errors, since their performance was ex- 
cellent at the end of the training given. 

Another group learned a fixed re- 
sponse in the absence of any good 
place cues. We assume that this 
group simply had neither correct nor 
incorrect place strategies. Its pre- 
dicted proportion of correct strategies 
is .130/1 — 2(.093) = .130/.814 
.160. The expected errors is .840/ 
.160 = 5.25: the observed mean was 
5.0. Both predictions are close to 
the obtained results, well within 
sampling deviations. 

Warren (1959a) has reported data 
on monkeys in an experiment analo- 
gous to Scharlock’s study of rats. 
Warren’s monkeys had to learn posi- 
tion habits in some problems and ob- 
ject-discriminations in other prob- 
lems. Warren also used a response- 
plus-object problem (object discrim- 
ination with the objects left in the 
same place each trial) and a pure re- 
sponse problem (e.g., choose the left 
one of two indistinguishable objects.) 
Predictions followed the same formu- 
las as above, and were extremely ac- 
curate: the proportions of object and 
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were estimated 
from behavior on object (position- 
varied) and position (object-varied) 
problems. When these values were 
added to predict object-plus-position, 
the prediction was a mean of 0.67 
errors, whereas Warren observed 0.63. 
For pure response learning, the model 
predicted 3.25 mean errors and the 
observed value was 3.04. These 
agreements between theory and data 
are well within the range of probable 
sampling error. 


response strategies 


Similar results were obtained by 
analyzing Warren's experiment on the 
additivity of color, form, and 
cues (Warren, 1953). The analysis is 
substantially the same as that given 
in a previous paper (Restle, 1958). 
Learning data are collected on prob- 
lems involving only color, only form, 
or only size differences between the 
objects. 


size 


These data are used to cal- 
culate the proportional weights of the 
three sources of strategies, and the 
resulting values 
compute predicted learning rates for 
problems involving two or more di- 
mensions; for example, discrimina- 
tion of a red triangle from a green 
circle involves color plus form cues. 
In each calculation account is taken 
of the greater total number of strate- 
gies involved in problems with added 
cues, and it is assumed that whenever 


are recombined to 


a set of correct strategies is intro- 


duced by adding cues, an equally 


large set of wrong strategies also enter 
the situation. 
Warren’s (1953) 
data afforded four predictions of total 
errors to solution. The four predic- 
tions were wrong by —17, 0, 5 and 10 
percent respectively, and all of the 
errors can reasonably be attributed 
to sampling variations. 

The additivity of 
cues has also been applied to human 
learning (Restle, 1959a) in an experi- 
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ment which required subjects to learn 
differential verbal responses to con- 
sonant syllables. The procedure was 
simple concept formation with indi- 
vidual letters of the syllables used as 
cues, and the same sort of additivity 
of cues as above. The adaptation 
model gave quite accurate predictions 
and the present model is, if anything, 
slightly more accurate; it predicts 
(given data on the two cues separately) 
that the added-cue group should make 
an average of 5.26 errors, whereas the 
observed mean was 5.25. Consider- 
ing the variability of the data, the 
extreme closeness is coincidental. 


COMPARISON WITH OTHER THEORIES 


The three models of the selection of 
strategies are conceptually similar to 
the writer’s adaptation theory of dis- 
crimination learning (Restle, 1955). 
The “strategies” of the present model 
resemble the ‘‘cues’” of the earlier 
theory. The strategy-selection model 
has, as a theorem, that the rate of 
learning depends upon the proportion 
of correct strategies. A similar idea 
is expressed in the adaptation model 
in which it was assumed that the rate 
of learning (@) would depend upon 
the proportion of relevant cues. 

Several of the serious faults of the 
adaptation theory are corrected in the 
theory of strategy-selection. First, 
in the adaptation theory the subject 
was supposed to begin conditioning 
relevant cues and adapting irrelevant 
cues, with a rate @ equal to the pro- 
portion of relevant cues, right from 
the first trial of training. Of course, 
at that first trial the subject has no 
possible way of knowing which cues 
will turn out to be relevant—it would 
be possible for the experimenter to 
change his mind after the first trial 
is complete. Hence, the adaptation 
theory could apply only to a prescient 
subject. The selection-of-strategies 
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theory does not have the subject 
treat different strategies differently 
except on the basis of trials already 
completed, hence avoids the absurd- 
ity. Second, the idea that the struc- 
ture of the problem (proportion of 
relevant cues or correct strategies) 
controls the rate of learning was only 
a simplifying assumption, with no 
justification, in the adaptation model; 
but is an inescapable theorem of 
strategy-selection. Third, the ad- 
aptation theory yielded a determinate 
learning curve in the sense that p() 
was exactly specified for a given @ and 
p(1). The variance of typical data 
was in large part left unexplained by 
the adaptation model and had to be 
attributed to individual differences, 
even though it is notoriously difficult 
to find any strong predictor of such 
learning. In the _ strategy-selection 
model learning itself is a random 


event and the model generates vari- 
ability comparable with that obtained 


in the data. 

Despite these important differ- 
ences, the present theory is close 
enough to the adaptation theory to 
make it possible to carry over many 
of the theoretical insights, though in 
modified form. The one type of pre- 
diction discussed in this paper was 
additivity of cues, but other ideas 
such as the proposed basis of learning 
sets (Restle, 1958) and various quan- 
titative relationships in concept identi- 
fication (Bourne & Restle, 1959) can 
be recast in the mold of the selection 
of strategies. The many similarities 
and few differences in predictions 
must be studied in detail and cannot 
be discussed here, except to mention 
that most of the results of the Bourne 
and Restle paper can be reproduced 
using the strategy-selection model. 

In more general terms, it may be 
remarked that the strategy-selection 
model is similar to the ideas of Lash- 
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ley (1928) and Krechevsky (1932) in 
general intent. In comparison with 
stimulus-sampling theoreies (Restle, 
1959b) the strategy-selection model is 
like theories of the observing re- 
sponse (Atkinson, 1959a, 1959b; 
Wyckoff, 1952), except that in the 
strategy-selection model there is no 
conditioning, only observing (select- 
ing). The mathematical structure of 
the model is very close to that of 
Bower’s (1960) one-element associa- 
tion model. 


SUMMARY 


A stochastic model for the solution 
of cue learning problems by the selec- 
tion of strategies was stated and de- 
veloped. Errors were shown to con- 
stitute a system of uncertain recur- 
rent events in Feller’s sense. Three 
models, one-strategy-at-a-time, all- 
strategies-at-once, and  a-random- 
sample-of-strategies, were formulated 
and shown to yield the same system 
of recurrent events, to be identical 
in terms of data. This is the in- 
dependence-of-sample-size theorem. 
The basic distribution of the recur- 
rent-events system was used to gener- 
ate a description of the data, and also 
a method for estimating the propor- 
tions of correct and wrong strategies 
in a problem. Variations of the 
model, which take account of per- 
severation and complex strategies, 
were indicated. Experimental evi- 
dence, mainly on the additivity of 
cues, was discussed. 
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This paper is concerned with organ- 
ization as a dependent variable in free 
recall verbal learning. A method will 
be described for measuring the extent 
to which subjects’ recall of verbal 
items presented in different orders on 
successive trials structured 
quentially. The method assumes no 
knowledge on the experimenter’s part 
as to the sources of organization, and 
it is thus applicable to free recall 
learning of any list of verbal items. 
The effect of repetition on organiza- 
tion, and the relation of organization 
to amount of recall will be briefly 
examined in a simple experiment. 
The method and the data are dis- 
cussed in terms of the problem of the 


is se- 


role of repetition in free recall verbal 


learning. 


VERBAL LEARNING AND 
ORGANIZATION 


The fact that learning occurs under 
conditions of practice, often appa- 
rently through mere repetition of the 
material, is empirically well known, 
but theoretically not yet fully under- 
stood. Indeed, the question of why 
repetition leads to better recall has 
not been raised too often explicitly, 
although it has been implicit in much 
experimental and theoretical work. 

The apparent lack of explicit inter- 
est in this problem seems to be related 
to the restricted aspects of subjects’ 
behavior which are studied in verbal 
learning experiments. In a_ large 
majority of studies the dependent 
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variables have been based on the 
operation of counting single unordered 
responses, assigned to categories such 
as “correct,” ‘‘intralist intrusion,”’ 
“remote backward association,” and 
the like. Given but a single basic 
response variable of this kind, there 
is relatively little that can be done 
with the question of why repetition 
is effective, other than studying the 
effect of different independent vari- 
ables on the rate of learning. One 
can be justifiably sceptical, however, 
of our chances of ever understanding 
the basic process of verbal learning 
by simply counting “correct” and 
various classes of “incorrect” re- 
sponses under a variety of experi- 
mental conditions. Bolles’ (1959) re- 
cent suggestion that, “If we are to 
understand human learning, we must 
determine what our Ss are really 
doing’ (p. 580), can be regarded as 
an example of growing realization that 
a critical reappraisal of response vari- 
ables in verbal learning is overdue. 
An important contribution to our 
understanding of the basic acquisition 
process might well be provided by 
response variables based on ordered 
classifications of data (Miller & 
Frick, 1949). Such variables are 
particularly interesting in the light of 
G. A. Miller’s (1956a, 1956b, 1956c), 
conception of verbal learning as a 
direct consequence of the process of re- 
coding or organization. Rehearsal or 
repetition, according to Miller, 1956b), 
has “the very important effect of or- 
ganizing many separate items into 
a single unit” (p. 43). Repetition 
does not change the basic storage 
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capacity of memory. Rather, organ- 
izing processes accompanying repeti- 
tion lead to an apparent increase in 
this capacity by increasing the in- 
formation load of individual units. 
Miller’s unitization hypothesis, if 
found tenable, has important impli- 
cations for research strategy and 
theory in verbal learning. One of 
the difficulties in evaluating the hy- 
pothesis lies in the lack of appropriate 
measures of organization. Organiza- 
tion as a response variable has been 
investigated by many psychologists 
(e.g., Bousfield, 1953; Cofer, 1959; 
Jenkins & Russell, 1952; Rothkopf 
& Coke, 1961), but its measurement 
has always depended on the experi- 
menter’s knowledge of sources of or- 
ganization present in the stimulus list. 
It is well known, however, that sub- 
jects are quite capable of memorizing 
materials which are not organized in 
any obvious manner, or for which the 
sources of organization cannot be 
readily specified. Thus, if the con- 
cept of organization is to have more 
general applicability, it seems neces- 
sary to develop methods that permit 
observation and quantification of or- 
ganization as a response measure in- 
dependent of the characteristics of the 
This is what the 
present paper attempts to do. 


stimulus material. 

Behavioral manifestations of the 
hypothesized organizing process can 
best be studied under conditions where 
the order in which the subject recalls 
Items which 
are organized into a single unit would 


items is free to vary. 


then be expected to occur in close 
temporal contiguity in subject’s re- 
call. Repeated occurrences of such 
sequential patterns would indicate the 
existence of the organizing process, 
and the extent to which this occurs 
would provide an estimate of the 
degree of organization. Thus a quan- 
titative analysis of sequential depend- 
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encies among items in subject’s free 
recall on successive trials would yield 
measures of a response variable closely 
related to the hypothetical organizing 
process, independently of quantity of 
recall. 

The method proposed in this paper 
is derived from information theory 
(Shannon & Weaver, 1949), and is 
quite similar to Miller and Frick’s 
(1949) “index of behavioral stereo- 
typy.”’ It provides for a measure of 
sequential redundancy in repeated 
ordered samples of a set of items. As 
the experimental paradigm used here 
involves presentation of material com- 
pletely free from any sequential re- 
dundancy, such redundancy or or- 
ganization in subject’s recall cannot 
be attributed to the input material. 
Rather, it is the subject who imposes 
a certain degree of organization on the 
material. For this reason such or- 
ganization is called subjective organ- 
ization (SO). In information theory 
terms, SO is ‘“‘noise’’ produced by the 
channel, that is the subject; it is in- 
formation in the output not found in 
the input (Miller, 1953). 


MEASUREMENT OF SUBJECTIVE 
ORGANIZATION 


Suppose we select ZL words, any 
words, and construct Z different or- 
ders of these words such that each 
word appears in each of the Z serial 
positions once and is preceded and 
followed by each other word just 
once. This can be done easily if 
L +41 isa prime number. We then 
represent each of these Z ordered lists 
to the subject on L separate trials. 
Words are shown one at a time on the 
memory drum. After each trial the 
subject is asked to record all the words 
that he remembers from the list, in any 
order he wishes. After all Z ordered 
lists have thus been presented we have 
L. recall records from the subject, one 
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TABLE 1 


A SAMPLE RECALL MATRIX FOR AN INDIVIDUAL SUBJECT 


nth word 


Nee ee 


nj; 1412 
Note.—Recall is pooled over 16 trials. 


for each trial. We then tabulate the 
frequency of all pairs of adjacent recall 
responses in a matrix consisting of 
L +1 rows and L + 1 columns, and 
compute the measure of SO from the 
data in the matrix. 

Table 1 shows a sample recall (out- 
put) matrix for a case where L = 16 
and where the subjects recall re- 
sponses are pooled for all 16 trials. 
Symbols x1, Xe, vie identifying 
the rows refer to the 1st, 2nd, . 
16th word in the mth position in the 
subject’s recall list; symbols y:, ye 

yie identifying the columns stand 
for the same words in the (m + 1)th 
position in the subject’s recall list. 
Thus x and y 
positions in the subject’s recall, and 
subscripts identify the contents of 
these positions, i.e., words. Symbol 
xo stands for the blank position (no 
word) immediately preceding the first 
word in the subject’s recall list, and 


represent successive 


(nm +1)th word 


| 14 


yo refers to the blank position im- 
mediately following the last word in 
the same list. The entries in the 
cells of the matrix show the frequency 
with which word i was followed by 
word j in the subject’s recall on all 16 
trials; their numerical values are 
symbolized as n,; Entries in the 
row labelled xo show the frequencies 
with which each word j stood at the 
beginning of the subject’s recall lists 
on 16 trials, and similarly entries in 
the column yo show frequencies with 
which each word i was in the last 
position of the recall lists on these 
trials. Zeros have been omitted in 
the matrix. For instance, mos = 3, 
which means that Word Number 6 
was in the first position (followed no 
other word) in the subject’s recall on 
three trials; m;3 = 0, indicating that 
Word Number 3 never followed Word 
Number 1 during the 16 trials, and so 
forth. The numerical values of mar- 
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ginal totals of rows, ;, show how 
many times each word 7 appeared in 
the subject’s recall on all 16 trials. 
Marginal totals for corresponding 
rows and columns are of course identi- 
cal. It is also to be noted that there 
are no entries in the cells along the 
main diagonal, from upper left to 
lower right, since normally no word 
follows itself in recall. 

Given a recall matrix such as the 
one in Table 1, a number of different 
measures of second-order sequential 
organization could be used. The 
measure of sequential organization 
adopted in the present paper is essenti- 
ally a measure of redundancy in a 
sequence of events when the probabil- 
ities of successive pairs of events are 
known or can be estimated: 


Che wn’ HT.(y) 
“\y) max H,(y) 


[1] 


IT,(y) here refers to the amount of 
information (uncertainty) in the event 
y when the preceding event x is known, 
and max H,(y) stands for the maxi- 
mum value of such _ uncertainty. 
This is Miller and Frick’s (1949) 
second-order index of behavioral stere- 
The formula for computing 
involves a minor 
modification of the above formula,’ 
simplifying calculations in addition 


otypy. 
second-order SO 


* Max H,(y) in Miller and Frick’s index of 
behavioral stereotypy is max 
H(x,y) — max H(x). Thus it is based on 
the assumption that all responses are equally 
frequent. For the present problem, however, 
it may be somewhat more meaningful to com- 
pute max’ H,(y) on the basis of the given fre- 
quency of individual responses, i.e., as deter- 
mined by subject’s recall: 


defined as 


max’ H,(y) = max H (x,y) — H (x) 


If max’ H,(y) is substituted for max H,(y) 
in Formula 1, that formula becomes mathe- 
matically equivalent with Formula 2, the 
measure of subjective organization as used in 
this paper. 


to being somewhat more appropriate 
to the free recall learning situation. 
This formula is as follows: 


> ni; log nij—min >> nj; log nj; 
tJ t.J 


> na; log n;—min >> nj; log nj; 

| oe 
the numerical 
the ith row 


SO 


where 1;; 
value of 


represents 
the cell in 


and jth column, »; represents the 

marginal total of the ith row, and 

min >> ,; log n;; represents the mini- 
7 


mum value that >> -”;; log ni; can 
t.J 
assume. 

In cases where no marginal total 
n; exceeds the length of the list L, 
min >> ;; log n;; is equal to 0, and 

i, j 
Formula 2 reduces to the following 


relatively simple expression: 
> nj; log nj; 
7 


SO = 
> 2; log n; 
: 


[3] 


In this formula }> n; log n; can be 
' 
regarded as a measure of maximum 
organization, >- n;; log n;; represents 
J 

the actual organization, and SO is 
simply a measure of actual organiza- 
tion relative to the maximum. Thus 
SO can assume all values between zero 
and unity, the former expressing the 
complete absence of second-order se- 
quential organization, as in the case 
of the ordered stimulus lists presented 
to the subject, and the latter the 
maximum degree of such organiza- 
tion, as in the case of an imaginary 
subject who recalls all words on all 
trials exactly in the same order. SO 
for the data in Table 1, incidentally, 
is .397. 

So far we have considered only 
second-order sequential organization, 
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based on pairs of successive responses. 
The method can be easily extended 
to higher-order dependencies, even 
though the computational labors in- 
volved would become quite prohibi- 
tive. 
order organization can 


However, estimates of higher- 
be obtained 
more easily. For instance, instead of 
setting up a three dimensional matrix 
for calculating third-order SO, we 
can tabulate data in a two dimensional 
matrix with mth items as rows and 
(n + 2)th items as columns. In this 
case we also need two rows for blank 
spaces preceding the first word (xo 
and xo_1), and two columns for blank 
spaces following the last word in each 
recall list (yo and yo,1). This proce- 
dure amounts to collapsing the three 
dimensional matrix along the (m + 1)th 
dimension. That is, we are comput- 
ing an SO measure based on pairs of 
responses separated by one response. 
Such an SO is labeled SO (Lag 1), 
and it does provide an estimate of the 
third-order SO. In a similar manner, 
still higher order dependencies can 
be evaluated. SO (Lag 2) is an esti- 
mate of the fourth-order sequential 
organization, SO (Lag 3) is an esti- 
mate of the fifth-order SO, and so 
forth. 

SO can be calculated for any block 
of trials, usually trials. 
Thus it is sometimes useful to indicate 


successive 


the size of the block on which the 
measure is based. For instance, SO 
(Blks = 3) means SO calculated from 
blocks of three successive trials. SO 
for a single trial is not meaningful, 
as subjective organization is defined 
in terms of the subject’s tendency 
to recall words in the same order on 
fact SO for a 


single trial, when calculated according 


successive trials; in 


to the formulas given, turns out to be 
0, provided that the subject has not 


repeated any pairs of words. 


ENDEL TULVING 


DEMONSTRATION EXPERIMENT: 
SUBJECTIVE ORGANIZATION AND 
PERFORMANCE 


The following simple experiment 
will serve as an illustration of how the 
method works, and the data from the 
experiment will provide tentative 
answers to three questions which can 
be raised in the context of the dis- 
cussion of the concept of organization 
in the first The questions 
are as follows: First, do the subjects 
actually organize ‘‘unrelated’’ words 
when instructions are simply to recall 
as many words as they can? Second, 
does this subjective organization , if 


section. 


found in the data, increase with repe- 
tition of the material? Third, is there 
a systematic relation between sub- 
jective organization and more tradi- 
tional measures of learning, such as 
the number of words correctly re- 
called ? 


Procedure 


\ list of 16 English words was adopted from 
another verbal learning experiment (List III 
in Tulving & Thornton, 1959). All words 
were disyllabic nouns, consisting of five, six, 
or seven letters. The words, here listed 
alphabetically, were as follows: ACCENT, BAR- 
RACK, DRUMLIN, FINDING, GARDEN, HOYDEN, 
ISSUE, JUNGLE, LAGOON, MAXIM, OFFICE, 
POMADE, QUILLET, TREASON, VALLEY, WALKER 
Sixteen different sequences of these words 
were constructed such that, considering any 
block or all 16 orders of the stimulus list, 
there was no second-order or higher-order 
redundancy in the lists. 

Sixteen female undergradute students en- 
rolled in the introductory psychology courses 
at the University of Toronto served as sub- 
jects. Their median age was 19 years. None 
of the subjects had previously participated in 
any verbal learning experiments. 

Each subject was tested individually. In 
the instructions the subject was told that her 
task was to learn a list of 16 words, two-syl- 
lable English nouns, that would be presented 
to her on 16 separate trials. At the end of each 
trial she was to write down as many words 
from the list as she remembered. Then she 
would be shown the same words again, in a 
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different order, and at the end of the trial she 
would again write down all the words that she 
was able to recall. The subject was also told 
that the order in which she recalled the words 
did not matter, her task was simply to recall 
as many of the words on each trial as she 
could. 

The 16 different orders of the stimulus list 
were typed on white paper in lower case letters 
and presented to the subject on a memory 
The rate of exposure was one word 
At the end of the trial the sub- 
ject was given 90 seconds to record her recall 
on sheets of paper lined with 16 consecutively 
numbered lines. 

Che order in which the 16 ordered lists were 


drum, 


per sec ond. 


presented to the subjects was systematically 
subjects. The 
method used for assigning the 16 orders of the 
stimulus list to the 16 subjects on the 16 trials 


counterbalanced among 


was the same as that used in assigning the 16 
words to 16 serial positions in 16 orders of the 
stimulus list. 


Results 


Two kinds of data are of interest in 
this experiment, performance (P) 
defined in terms of frequency of cor- 
rect recall, and subjective organiza- 
tion (SO). Extralist intrusions and 
misspelled words from the list were 
ignored in computing both P and SO 
§ ~6Repetitions of list words 
within a given trial were included in 
the recall matrix and thus entered 
the SO score, but not the P score. 
The mean number of extralist intru- 


scores.’ 


sions, together with misspelled words, 
was 0.13 per trial, the mean number 
of repetitions of list words was 0.09 
per trial. 

Second-order SO scores were calcu- 


lated for successive blocks of two 
trials (SO, Blks 2), for running 
blocks of three trials (SO, Blks 3) 
and for the total block of 


’ 


16 trials 


> Such extralist intrusions are of interest in 
their own right and may provide valuable 
information about the organizing 
In order to handle them in the present situ- 
ation, the recall matrix can be simply extended. 
However, such intrusions can probably be 
better studied in different experimental situ- 
ations (e.g., Deese, 1959). 


process. 
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Fic. 1. Mean performance (upper curve) 
and mean SO (lower curve) as a function of 
trials. (Values of performance are to be read 
from the left ordinate, SO from the right 
Mean SO for blocks of two trials 
from 16 statistical subjects are shown for 
comparison, open triangles. For further 
explanation see the text.) 


ordinate. 


(SO, Blks = 16). SO (Blks = 2) 
scores were also computed for 16 
statistical subjects.‘ 

The effect of repetition on both P 
and SO is summarized graphically in 
Figure 1. The values of P are to be 
read from the left, the values of SO 
from the right hand ordinate. The 
abscissa represents trials. The upper 
curve is the typical learning curve, 
showing performance as a function of 
trials. The lower curve, fitted by 
inspection to data points representing 
SO (Blks = 3), could be called an 


*1BM Model 650 Electronic Data Process- 
ing Machine was used in these calculations. 
Thanks are due to Albert S. Bregman for 
constructing the program. Data from statis- 
tical subjects, whose ‘“performance’’ was 
matched with that of individual experimental 
subjects, were collected as an attempt to 
estimate the amount of organization occurr- 
ing by chance, since the sampling distribution 
of SO, under the null hypothesis of no or- 
ganization, is not known. ‘ 
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“organization” curve. The SO for 
each block is plotted against the 
middle trial of the block. Thus there 
are 14 SO (Blks = 3) scores. SO 
scores for successive blocks of size 
two are also shown for both the real 
and the statistical subjects, plotted 
against the midpoints of the trial 
blocks. 

These data clearly demonstrate 
that subjects do in fact organize their 
recall sequentially even in the absence 
of such sequential organization in 
stimulus lists, and that this organiza- 
tion increases systematically with 
repeated exposures to, and recall of 
the material. Thus, in an experi- 
mental situation such as the present 
one, repetition has two parallel effects: 
increasing frequency of recall is ac- 
companied by an increasingly tighter 
sequential organization. 

The slopes of the two curves de- 
picted in Figure 1 seem to be different. 
When mean FP scores are plotted 
against mean SO (Blks = 3) for the 
14 trials, the curvilinearity of the rela- 
tion is obvious. However, the rela- 
tion becomes quite linear when a 
logarithmic transformation is applied 
to the mean SO scores, suggesting 
that, as a first approximation, increase 
in performance is proportional to in- 
crease in log SO. The product-mo- 
ment correlation between mean P 
and log mean SO (Blks = 3) for 
Trials 2 to 15 was found to be +.96. 

Estimates of higher-order SO were 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF SO 
(Blks = 16) FoR VARIOUS LAGs 


| | 
Group Lag 0) Lag 1| Lag 2! Lag 3 


Experimental subjects 


Mean | .292 224 .223 | .216 


| 
} 
Standard deviation | .054 | .045 | .034 | .035 
Statistical subjects | 
Mean .180 | .169 
Standard deviation 017 } } O12 


Note.—N 


= 16 for each group 
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TABLE 3 
INTERCORRELATIONS AMONG \SO (Blks=16) 
MEASURES FOR LaAGs 0, 1, 2, AND 3; AND 
CORRELATIONS BETWEEN THESE SO 
SCORES AND MEAN P ScoREsS ON 
TRIALS 1 To 8, 9, To 16, AND 
1 To 16 FoR INDIVIDUAL 
SUBJECTS 


Measures 
Lag 0 | Lag 1 

SO (Lag 1) + 86 

SO (Lag 2) +.62 

SO (Lag 3) t.58 

P (Trials ito8) | 45 | 

P (Trials 9 to 16) 78 | 

P (Trials 1 to 16) +.63 | 


Note N = 16 


obtained in the form of SO scores for 
Lags 1, 2, and 3, for the total block of 
16 trials. The mean SO (Blks 16) 
scores for these lags and their standard 
deviations are shown in Table 2, 
together with comparable data from 
the statistical subjects for Lags 0 and 
3. Even though mean SO 
seem to decrease with increasing lags, 
it is interesting to note that SO (Lag 3, 
Blks = 16) scores are higher than the 
same scores from the statistical sub- 
jects. The median test yielded a 
chi-square of 15.12, which is highly 
significant. 

The intercorrelations 
(Blks = 16) scores for Lags 0, 1, 2, 
and 3, as well as correlations between 
these SO scores and several average 
P measures are shown in Table 3. 
These correlations are based on indi- 
vidual data from 16 subjects. 

Certain orderly relations appear in 
Table 3. First, the correlations be- 
tween any two adjacent SO measures 
on the lag dimension are reasonably 
high, all three listed in the table being 
approximately +.85. Second, with 
increasing distance on the lag dimen- 
sion between any two SO measures 
Thus, for 


scores 


among SO 


the correlations decrease. 
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example, SO (Lag 0) has a correlation 
of +.85 with that for Lag 1, +.62 
with Lag 2, and +.58 with Lag 3. 
Third, correlations between P and 
SO decrease with increase in the lag of 
the SO measure. The most import- 
ant conclusion to be drawn from these 
data is that, within the limits of the 
present method, the second-order SO 
is about as useful a measure of se- 
quential organization as that based 
on any combination of different 
orders, and more useful than any 
other single higher-order SO measure. 

Positive correlations between SO 
and P scores in Table 3, where correla- 
tion is done on data from 16 subjects, 
support the observation of the same 
relation over trials. It is also inter- 
esting to note that on the basis of a 
single estimate of organization for 
each subject, that based on the total 
block of 16 trials, SO accounts for a 
larger amount of variance in the P 
variable for Trials 9 to 16 than for 
Trials 1 to 8. 

All the above findings relate to 
organization as an intrasubject phe- 
nomenon. Commonality of organ- 
ization between subjects, however, can 
also be investigated by using the same 
general method. That there might 
be such commonality became appar- 
ent in the inspection of recall data 
from different subjects. Recall ma- 
trices from individual subjects showed 
that quite often the patterning of re- 
sponse sequences was similar for many 
subjects. This phenomenon was in- 
vestigated in two ways. 

First,. the recall data from all 16 
subjects were pooled for all 16 trials 
and entered into an “‘intersubject re- 
call matrix.”” If all subjects organized 
their recall differently, the distribu- 
tion of cell entries in the group recall 
matrix would not be significantly 
different from the distribution under 
conditions where only chance factors 


IN FrEE RECALL 





MEAN INTERSUBJECT SO 











2 ae os @T se 
BLOCKS OF TRIALS 
Fic. 2. Mean intersubject SO scores” for 


16 subjects on eight successive blocks of two 
trials. 


are operating. To test the signifi- 
cance of the apparent deviation of the 
data in this matrix from chance dis- 
tribution, the information function 
T (x,y) was computed. It was found 
to be 0.352 bits which is significant at 
better than .01 level, using the method 
suggested by Miller (1955). This 
finding thus confirms the casual ob- 
servation that there is a certain degree 
of commonality in the subject's recall. 


The second method used to explore 
intersubject organization was as fol- 


lows. For a single trial data were 
pooled in a recall matrix from all 16 
subjects. For each such intersubject 
recall matrix SO was computed. This 
procedure results in 16 intersubject 
SO measures, one for each trial. 
These data are shown in Figure 2. 
Intersubject SO measures have been 
averaged for blocks of two trials, in 
order to smooth the curve. In view 
of the orderly increase in the intersub- 
ject SO measure with repetition, it 
would seem safe to conclude that the 
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commonality of organized recall se- 
quences between subjects increases 


with repetition of the material. 
Disc USSION 


All three questions posed at the 
beginning of the experiment have been 
answered in the affirmative. It seems 
that the subjects do impose a se- 
quential structure on their recall, that 
this subjective organization increases 
with repeated exposures and recall 
of the material, and that there is a 
positive correlation between organ- 
ization and performance. None of 
these findings is very surprising, and 
all are in good agreement with related 
experimental Bousfield 
and his associates, as well as other 
amply demon- 


evidence. 


investigators, have 
strated that there is a strong tendency 
in subjects to recall randomly pre- 
sented material in sequences of re- 
lated words or clusters (Bousfield, 
1953; Cohen & Bousfield, 1956; Jen- 
kins & 1952). It 
been shown that repeated presenta- 
tions of the stimulus word list in- 
creases this clustering tendency (Bous- 
field & Cohen, 1953), and that there 
is a direct relation between the degree 
of clustering and the amount of recall 
(Bousfield & Cohen, 1955; Bousfield, 
Cohen, & Whitmarsh, 1958; Jenkins, 
Mink, & Russell, 1958; Sakoda, 1956). 
The findings of the present experiment 
extend the domain of these phenomena 
of organization from one trial recall 
and experimentally ma- 
terials, as used in all the above ex- 
learning situation, 
successive 


Russell, has also 


organized 


periments, to a 
with several 
experimentally unorganized materials. 

The finding that the organizing 
effects, well demonstrated in experi- 
ments on clustering, can be experi- 
mentally assessed even in case of un- 
related words, is quite encouraging. 
Perhaps paradoxically this suggests 
that a list of completely unrelated 


trials, and 


TULVING 


words is probably as fictional as is a 
truly nonsensical nonsense syllable. 


It is for this reason that reference has 
been made throughout this paper to 
‘“‘unrelated’’ words, referring only to 
the fact that words are experimentally 
unselected as to their meaning. 

The present method of quantifying 
sequential organization in free recall 


learning constitutes only a_ rather 
coarse net for capturing the process of 
organization in all its diversity. For 
one thing, the SO measure is most 
sensitive to rigid sequences of re- 
sponses, or chunks (Miller, 1956c). 
It is quite likely, however, that a great 
deal of organization occurs in the form 
of clusters (Bousfield, 1953), in which 
have no fixed order. This 
would tend not only to attenuate 
measures of SO, but also, through 
increased variability of SO, depress 
correlations with the performance 
variable. 

It would be interesting, in view of 
Miller’s unitization hypothesis, to 
look at organization in terms of its 
In many cases subjects un- 


items 


units. 
doubtedly impose a hierarchical or- 
ganization on recall, or use a Plan 
(Miller, Galanter, & Pribram, 1960). 
In other cases, particularly when the 
list to be learned is short and some 
coding device such as the alphabet 
can be employed, nonhierarchical or- 
ganization may also occur. ‘The pres- 
ent method does not lend itself to 
an identification of various strategies 
adopted by subjects, even though 
casual inspection suggests many di- 
verse sources of organization: associ- 
ative grouping, conceptual categories, 
assonance, grouping in terms of famil- 
iarity of items, and so forth. 

The method provides only an over- 
all measure of organization. This 
measure is related to the size of the 
organizing units in a systematic man- 
ner, but the relation cannot be easily 


specified. The two extremes of the 
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function, however, are determined by 
the definition of the SO measure, and 
the transition from one to the other 
must be gradual. In one case the 
units of organization are individual 
items in the list, resulting from previ- 
ous response integration (Mandler, 
1954) outside the laboratory. In this 
case the probability of two or more 
items being recalled consistently to- 
gether is no greater than what might 
be expected by chance, and the nu- 
merical value of SO is approximately 
the same as that of statistical sub- 
jects. On the other extreme, the 
whole set of items is organized into 
one unit, and SO is at the maximum. 

Assuming a systematic relation be- 
tween the size of the organizing units 
and the numerical value of SO, the 
finding that the second-order SO ac- 
counts for a larger proportion of vari- 
ance in individual P scores than do 
estimates of higher-order measures 
makes good sense. Some 
organization contain only two items, 
and, given a flexible order of units in 
recall, these are not tapped by the 
third-order and higher-order measures 
of SO, while the second-order measure 


units of 


is sensitive to all units of two or more 
interpretation is also 
consistent with the observed inter- 
correlations among SO measures for 
different lags. Although these meas- 
ures are all based on identical se- 
quences of responses, the correlations 
are less than perfect. This probably 
reflects the fact that distributions of 
size of organizing units vary from sub- 
ject to subject. 

What is the significance of the 
method and the preliminary findings 
for the question originally posed about 
the effects of repetition on perform- 
ance? It seems that response vari- 
ables such as SO, based on ordered 
classifications of data, might be quite 
useful in our attempts to shed more 
light on the nature of the acquisition 


items. This 
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process in verbal learning. And the 
present findings seem to have con- 
tributed to the attractiveness of 
Miller’s unitization hypothesis (1956a) 
as a serious beginning of a_ useful 
theory of free recall verbal learning. 
Even though intuitively there seems 
little doubt that performance de- 
pends on organization, the correla- 
tional design used in the present study 
does not yet permit such a conclusion, 
and other kinds of experiments are 
necessary. Given a method of quan- 
tifying behavioral manifestations of 
the organizing process, it is quite 
possible to test many implications of 
the unitization hypothesis. 

It is to be noted that the concept of 
repetition has been specified here 
rather loosely. There are at least 
two operationally distinguishable 
phases which should be studied sep- 
arately, presentation of material and 
test for recall. A conceptual and ex- 
perimental clarification of this im- 
portant concept is clearly needed. 

The observation of common recall 
sequences among different subjects 
and of increased stereotypy of such 
intersubject organization under con- 
ditions of practice very strongly sug- 
gests, or rather confirms the expecta- 
tion, that sources of organization are 
discovered by subjects in the material, 
rather than invented idiosyncrati- 
cally. Many task variables, therefore, 
are expected to influence both subjec- 
tive organization and recall of the 
material, and a systematic exploration 
of these relations constitutes an im- 
portant part of the research program 
designed to evaluate the tenability of 
the unitization hypothesis. 


SUMMARY 


A method for examining and quanti- 
fying sequential dependencies in the 
subjects’ free recall of words on suc- 
cessive recall trials has been presented. 
Subjective organization as a depend- 
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ent variable was defined in terms of 
the subject’s tendency to recall items 
in the same order on different trials in 
the absence of any experimentally 
manipulated sequential organization 
among items in the stimulus list. In 
a preliminary experiment it was found 
that the subject’s recall behavior 
manifests such subjective organiza- 
tion, that this organization increases 
with repetition, and that there is a 
positive correlation between organ- 
ization and performance. These data 


were discussed with reference to the 
problem of the role of repetition in 
free recall verbal learning. 
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The growing interest in autoin- 
structional devices and their component 
learning programs has had the effect 
of focusing attention on what may be 
called “productive learning.” By this 
phrase is meant the kind of change in 
human behavior which permits the in- 
dividual to perform successfully on an 
entire class of specific tasks, rather 
than simply on one member of the 
class. Self-instructional programs are 
designed to ensure the acquisition of 
capabilities of performing classes of 
tasks implied by names like “binary 
numbers,” “musical notation,” “solving 
linear equations” rather than tasks re- 
quiring the reproduction of particular 
responses. 

When viewed in this manner, learn- 
ing programming is not seen simply as 
a technological development incorpo- 
rating previously established learning 
principles, but rather as one particular 
form of the ordering of stimulus and 
response events designed to bring 
about productive learning. It should 
be possible to study such learning, and 
the conditions which affect it, by the 
use of any of a variety of teaching ma- 
chines, although there are few studies 
of this sort in the current literature 
(cf. Lumsdaine & Glaser, 1960). In 
the laboratory, the usual form taken by 
studies of productive learning has been 
primarily that of the effects of instruc- 
tions and pretraining on problem solv- 
ing (e.g., Hilgard, Irvine, & Whipple, 


1 This study was made possible in part by 
funds granted by the Carnegie Corporation 
of New York. The opinions expressed are 
those of the author, and do not necessarily 
reflect the views of that Corporation. 


1953; Katona, 1940; Maltzman, Eis- 
man, Brooks, & Smith, 1956). 

When an individual is subjected to 
the situation represented by a learning 
program, his performance may change, 
and the experimenter then infers that 
he has acquired a new capability. It 
would not be adequate to say merely 
that he has acquired new “responses,” 
since one cannot identify the specific 
responses involved. (Adding frac- 
tions, for example, could be repre- 
sented by any of an infinite number of 
distinguishable stimulus situations, and 
an equal number of responses.) Since 
we need to have a term by means of 
which to refer to what is acquired as 
a result of responding correctly to a 
learning program, we may as well use 
the term “knowledge.” By definition, 
“knowledge” is that inferred capability 
which makes possible the successful 
performance of a class of tasks that 
could not be performed before the 
learning was undertaken. 


Some Initial Observations 


In a previous study of programmed 
learning (Gagné & Brown, 1961) sev- 
eral kinds of learning programs were 
used in the attempt to establish the 
performance, in high school boys, of 
deriving formulas for the sum of n 
terms in a number series. Addi- 
tional observations with this material 
led us to the following formulation: 
In productive learning, we are dealing 
with two major categories of variables. 
The first of these is knowledge, that 
is, the capabilities the individual pos- 
sesses at any given stage in the learn- 
ing; while the second is instructions, 


>) 
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the content of the communications pre 
sented within the frames of a learning 
program. 

In considering further the knowl- 
edge category, it has been found pos- 
to identify this class of variable 
more comprehensively in the following 
way: Beginning with the final task, the 
question is asked, What kind of capa- 
bility would an individual have to 
possess if he were able to perform this 
task successfully, were we to 
him only instructions? The 
to this question, it turns out, identifies 
a new class of task which appears to 
have several important characteristics. 
Although it is conceived as an internal 
“disposition,” it is directly measurable 
Yet it is not the 


sible 


give 


answer 


as a performance. 


same performance as the final task 


from which it was derived. It is in 
some sense simpler, and it is also more 
general. In other words, it appears 
that what we have defined by this pro- 
cedure is an entity of “subordinate 
knowledge” which is essential to the 
performance of the more specific final 
task. 

Having done this, it was natural to 
think next of repeating the procedure 
with this newly defined entity (task). 
What would the individual have to 
know in order to be capable of doing 
this task without undertaking any 
learning, but given only some instruc- 
tions? This time it seemed evident 
that there were two entities of sub- 
ordinate knowledge which combined in 
support of the task. Continuing to 
follow this procedure, we found that 
what we were defining was a hierarchy 
of subordinate knowledges, growing in- 
creasingly “simple,” and at the same 
time increasingly general as the defin- 
ing process continued. 

By means of this systematic analysis, 
it was possible to identify nine sepa- 
rate entities of subordinate knowledge, 
arranged in hierarchical fashion (see 


Figure 1 Generally stated, our 
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hypothesis was that (a) no individual 
could perform the final task without 
having these subordinate capabilities 
(i.e., without being able to per- 
form these simpler and more general 
tasks) ; and (/) that any superordinate 
task in the hierarchy could be per- 
formed by an individual provided 
suitable instructions were given, and 
provided the relevant subordinate 
knowledges could be recalled by him. 

It may be noted that there are some 
possible resemblances between the enti- 
ties of such a knowledge hierarchy and 
the hypothetical constructs described 
by three other writers. First are the 
habit-family hierarchies of Maltzman 
(1955), which are conceived to medi- 
ate problem solving, and are aroused 
by instructions (Maltzman et al., 
1956). The second are the “organiza- 
tions” proposed by Katona (1940), 
which are considered to be combined 
by the learner into new knowledge 
after receiving certain kinds of instruc- 
tions, without repetitive practice. The 
third is Harlow’s (1949) concept of 
learning set. Harlow’s monkeys ac- 
quired a general capability of success- 
fully performing a class of tasks, such 
as oddity problems, and accordingly 
are said to have acquired a learning 
set. There is also the suggestion in 
one of Harlow’s (Harlow & Harlow, 
1949) reports that there 
hierarchical arrangement of tasks more 
complex than oddity problems which 
monkeys can _ successfully perform. 
Since we think it important to imply 
a continuity between the relatively 
complex performances described here 
and the simpler ones performed by 
monkeys, we are inclined to refer to 
these subordinate capabilities as “learn- 
ing sets.” 


may be a 


REQUIREMENTS OF THEORY 
If there is to be a theory of produc- 
tive learning, it evidently must deal 
with the independent variables that can 
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be identified in the two major cate- 
gories of instructions and subordinate 
capabilities, as well as with their inter- 
actions, in bringing about changes in 
human performance. 


Instructions 

Within a learning program, instruc- 
tions generally take the form of sen- 
tences which communicate something 
to the learner. It seems possible to 
think of such “communication” as 
being carried out with animals lower 
than man, by means of quite a dif- 
ferent set of experimental operations. 
Because of these communications, the 
human learner progresses from a point 
in the learning sequence at which he 
can perform one set of tasks to a point 
at which he achieves, for the first time, 
a higher level learning set (class of 
tasks). What functions must a theory 
of knowledge acquisition account for, 
if it is to encompass the effects of in- 
The following paragraphs 
these func- 


structions ? 
will attempt to 
tions, not necessarily in order of im- 


describe 


portance. 

First, instructions make it possible 
for the learner to identify the required 
terminal performance (for any given 
learning set). In educational terms, it 
might be said that they “define the 
goal.” For example, if the task is 
adding fractions, it may be necessary 
for the learner to identify 15%4 as an 


adequate answer, and ®%4 as an inade- 


quate one, 

Second, instructions bring about 
proper identifications of the elements 
of the stimulus situation. For ex- 
ample, suppose that problems are to 
be presented using the word “fraction.” 
The learner must be able to identify 75 
as a fraction and .4 as not a fraction. 
Or, he may have to identify = as “sum 
of,” and » as “number.” Usually, in- 
structions establish such identifications 
in a very few repetitions, and some- 
times in a single trial. If there are 
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many of them, differentiation may re- 
quire several repetitions involving con- 
trasting feedback for right and wrong 
responses. 

A third function of instructions is 
to establish high recallability of learn- 
ing sets. The most obviously manipu- 
lable way to do this is by repetition. 
However it should be noted that repeti- 
tion has a particular meaning in this 
context. It is not exact repetition of 
a stimulus situation (as in reproductive 
learning), but rather the presentation 
of additional examples of a class of 
tasks. Typically, within a learning 
program, a task representing a particu- 
lar learning set is achieved once, for 
the first time. This may then be fol- 
lowed by instructions which present 
one or more additional examples of this 
same class of task. “Variety” in such 
repetition (meaning variety in the 
stimulus context) may be an important 
subvariable in affecting recallability. 
Instructions having the function of es- 
tablishing high recallability for learn- 
ing sets may demand “recall,”’ as in the 
instances cited, or they may on other 
occasions attempt to achieve this effect 
by “recognition” (i.e., not requiring 
the learner to produce an answer). 

The fourth function of instructions 
is perhaps the most interesting from 
the standpoint of the questions it 
raises for research. This is the 
“guidance of thinking,” concerning 
whose operation there is only a small 
amount of evidence (cf. Duncan, 1959). 
Once the subordinate learning sets 
have been recalled, instructions are 
used to promote their application to 
(or perhaps “integration into”) the 
performance of a task that is entirely 
new so far as the learner is concerned. 
At a minimum, this function of instruc- 
tions may be provided by a statement 
like “Now put these ideas together to 
solve this problem;” possibly this 
amounts to an attempt to establish a 
set. Beyond this, thinking may be 
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guided by suggestions which progres- 
sively limit the range of hypotheses 
entertained by the learner, in such a 
way as to decrease the number of in- 
correct solutions he considers (cf. 
Gagné & Brown, 1961; Katona, 1940). 
Within a typical learning program, 
guidance of thinking is employed after 
identification of terminal performance 
and of stimulus elements have been 
completed, and after high recallability 
of relevant learning sets has been en- 
sured. In common sense terms, the 
purpose of these instructions is to sug- 
gest to the learner “how to approach 
the solution of a new task” without, 
however, “telling him the answer.” 

Obviously, much more is needed to 
be known about the effects of this vari- 
able, if indeed it is a single variable. 
Initially, it might be noted that guid- 
ance of thinking can vary in amount; 
that is, one can design a set of instruc- 
tions which say no more than “now do 
this new task” (a minimal amount) ; 
or, at the other end of the scale, a set 
of instructions which in effect suggest 
a step-by-step procedure for using pre- 
viously acquired learning sets in a new 
situation. 


Subordinate Capabilities: Learning 
Sets 

When one begins with the perform- 
ance of a particular class of tasks as 
a criterion of terminal behavior, it is 
possible to identify the subordinate 
learning sets required by means of the 
procedure previously described. The 
question may be stated more exactly 
as, “What would the individual have 
to be able to do in order that he can 
attain successful performance on this 
task, provided he is given only instruc- 
tions?” This question is then applied 


successively to the subordinate classes 
identified by the answer. 
“What he would have to be able to do” 
is in each case one or more perform- 


of tasks 
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ances which constitute the denotative 
definitions of learning sets for particu- 
lar classes of tasks, and totally for the 
entire knowledge hierarchy. 

A theory of knowledge acquisition 
must propose some manner of func- 
tioning for the learning sets in a 
hierarchy. A good possibility seems 
to be that they are mediators of posi- 
tive transfer from lower-level learning 
sets to higher-level tasks. The hy- 
pothesis is proposed that specific trans- 
fer from one learning set to another 
standing above it in the hierarchy will 
be zero if the lower one cannot be 
recalled, and will range up to 100% if 
it can be. 

In narrative form, the action of the 
two classes of variables in the acquisi- 
tion of knowledge is conceived in the 
following way. A human learner be- 
gins the acquisition of the capability of 
performing a particular class of tasks 
with an individual array of relevant 
learning sets, previously acquired. He 
then acquires new learning sets at pro- 
gressively higher levels of the knowl- 
edge hierarchy until the final class of 
tasks is achieved. Attaining each new 
learning set depends upon a process of 
positive transfer, which is dependent 
upon (a) the recall of relevant sub- 
ordinate learning sets, and upon (b) 
the effects of instructions. 


EXPERIMENTAL PREDICTIONS 
AND RESULTS ? 


Using the procedure described, we 
derived the knowledge hierarchy de- 
picted in Figure 1 for the final task of 
“deriving formulas for the sum of n 
terms in number series.” 

As mentioned previously, it con- 
tained nine hypothesized learning sets. 
(The final row of circled entities will 


2The author is grateful to Bert Zippel, 
Jr., for assistance in the preparation of learn- 
ing program materials and in the collection 
of a portion of the data. 
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Hierarchy of knowledge for the task of finding formulas 


for the 


for the sum of nm terms in a number series. 


be discussed later.) Each of these 
subordinate knowledges can be repre- 
sented of task to be per- 
formed. 


as a class 


Measuring Initial Patterns of Learning 
Sets 

It is predicted that the presence of 
different patterns of learning sets can 
be determined for individuals who are 
unable to perform a final task such as 
the one under consideration. To test 
this, we administered a series of test 
items to a number of ninth-grade 
boys. These items were presented on 
4” x 6” cards, and the answers were 
written on specially prepared answer 


sheets. This particular method was 
used in order to make testing continu- 
ous with the administration of a learn- 
ing program to be described hereafter. 
Each test item was carefully prepared 
to include instructions having the 
function of identification of terminal 
performance and of elements of the 
stimulus situation. 

3eginning with the final task, the 
items were arranged to be presented in 
the order I, ITA, IVA, IVA2, IVAB, 
IIB, IIIB, 1VB1, and IVB2. For any 
given subject, the sequence of testing 
temporarily stopped at the level at 
which successful performance was first 
reached, and a learning program de- 
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signed to foster achievement at the 
next higher level (previously failed) 
was administered. This program and 
its results will shortly be described. 
Following this, testing on the remain- 
ing learning set tasks was undertaken 
in the order given. The possibility of 
effects of the learning program on the 
performance of these lower-level learn- 
ing sets (not specifically practiced in 
the learning) is of course recognized, 
but not further considered in the pres- 
ent discussion. 

A particular time limit was set for 
each test item, at the expiration of 
which the item was scored as failed. If 
a wrong answer was given before this 
time limit, the subject was told it was 
wrong, and encouraged to try again; if 
the correct answer was supplied within 
the time limit, the item was scored as 
passed. It is emphasized that these 
time limits, which were based on pre- 
liminary observations on other subjects 
with these tasks, were not designed to 
put “time pressure” on the subjects, 
nor did they appear to do so. 

The patterns of success achieved on 
the final task and all subordinate learn- 
ing set tasks, by all seven subjects, are 
shown in Table 1. The subjects have 
been arranged in accordance with their 
degree of success with all tasks, begin- 


Ropsert M. GAGneE 


ing with one who failed the final task 
but succeeded at all the rest. Several 
things are apparent from these data. 
First of all, it is quite evident that 
there are quite different “patterns of 
capability” with which individuals ap- 
proach the task set by the study. Some 
are unable to do a task like IIA (see 
Figure 1), others to do a task like IIB 
which is of course quite different. Still 
others are unable to do either of these, 
and in fact cannot perform successfully 
a task like IIIB. All seven of these 
subjects were able to perform IV-level 
tasks successfully, although in prelimi- 
nary observations on similar tasks 
we found some ninth-grade boys who 
could not. 

Second, the patterns of pass and fail 
on these tasks have the relationships 
predicted by the previous discussion. 
There are no instances, for example, 
of an individual who is able to perform 
what has been identified as a “higher- 
level” learning set, and who then shows 
himself to be unable to perform a 
“lower-level” learning set related to it. 

If learning sets are indeed essential 
for positive transfer, the following con- 
sequences should ensue: 

1. If a higher-level learning set is 
passed (+), all related lower-level 
tasks must have been passed (+). 


TABLE 1 


PATTERN OF SUCCESS ON LEARNING Set Tasks RELATED TO THE FINAL NUMBER 
SERIES TASK FOR SEVEN NINTH GRADE Boys 


Subject 


IIIB 


ww 
wc 
PM 
GR 
DJ 
JR 
RH 


Note.— 


\ 
a 


+ 
+ 


Task 


IVBI | IVB2 


IvA1 | IVA2 


++++4+4++4+] 





| H+ +++++ | 
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TABLE 2 


Pass-FAtL RELATIONSHIP BETWEEN RELATED ADJACENT HIGHER- AND LOWER-LEVEI 
LEARNING SETS FOR A GROUP OF SEVEN NINTH-GRADE Boys 


Number of cases with relationship 


Relationship exami 
Higher 
Lower 


Higher + 


Lower 
Final Task: I 0 6 
1: ILA, IIB 5 
IIA: IVA1, IVA2, IVAB 0 


IIB: IIIB 2 
I11B: IVAB, IVB1, IVB2 ; 0 


Fail 


2. If one or more lower-level tasks 
been failed (—), the related 
higher-level tasks must be failed (-—). 


have 
3. If a higher-level task is passed 
(+), no related lower-level tasks must 
have been failed (—). 

4. If a higher-level task has been 
failed (—), related lower-level tasks 
may have been passed (+). The ab- 
sence of positive transfer in this case 
would be attributable to a deficiency in 
instructions, and does not contradict 
the notion that lower-level sets are es- 
sential to the achievement of higher- 
level ones. 

The relationships found to exist in 
these seven subjects are summarized in 
Table 2, where each possible higher 
lower-level task relationship possible of 
testing is listed in the left-hand column. 
It will be noted that there are several 
relationships of the type higher (—), 
Column 5. 


lower (+), as listed in 


These provided no test of the hypothe- 
relations 
The instances in 


sis regarding hierarchical 
among learning sets. 
the remaining columns do, however. 
The ++ and —— instances are verify- 
+— instances would be 
nonverifying. As the final 
indicates, the percentage of verifying 
instances is in all cases 100%. 


ing, whereas 
column 


Test of relationships 


N Proportion 
(1 +2 +3) (1+2)/(1 +2 +3) 
ef 1.00 
Sie 1.00 

1.00 

1.00 

1.00 


Higher — 
Lower + 


Higher + 
Lower — 


NnNUe 


Effects of Learning Program Adminis- 
tration 


If the characteristics of instructions 
as previously described are correct, it 
should be possible to construct a learn- 
ing program which can be begun for 
each individual at the point of his low- 
est successful learning set achievement, 
and bring him to successful achieve- 
ment of the final class of tasks. Briefly, 
its method should be to include frames 
which have the functions of (@) insur- 
ing high recallability of relevant learn- 
ing sets on which achievement has been 
demonstrated; (b) making possible 
identifications of expected perform- 
ance and of new stimuli, for each newly 
presented task; and (c) guiding think- 
ing so as to suggest proper directions 
for hypotheses associating subordinate 
learning sets with each new one. 

A program of this sort was admin- 
istered to each of the seven ninth-grade 
boys, beginning at the level at which he 
first attained success on learning set 
tasks (Table 1). This was done by 
means of a simple teaching machine 
consisting of a visible card file clipped 
to a board mounted at a 40° angle to 
the learner’s table, and containing 
material typed on 4” X 6” cards. He 
wrote his answer to successive frames 
on a numbered answer sheet, then 
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flipped over the card to see the correct 
answer on the back. He was in- 
structed that if his answer was wrong, 
he should flip the card back, and read 
the frame again until he could “see” 
what the right answer was. 

After completing the instructional 
portion of the program for each learn- 
ing set, the learner was again presented 
with the identical test-item problem he 
had tried previously and failed. If he 
was now able to do it correctly, he was 
given five additional items of the same 
sort to perform, and then taken on by 
instructions to another learning set in 
either a coordinate or higher-level posi- 
tion in the hierarchy. This process 
was continued through the perform- 
ance of the final task. 

The data collected in this way yield 
pass-fail scores on each test item 
(representing a particular member of a 
class of tasks) before the administra- 
tion of the learning program, and simi- 
lar scores on the same item after learn- 
ing. It is recognized that for certain 
experimental purposes, one would wish 
to have a different, matched, task for 
the test given after learning, to control 
for the effects of “acquaintance” during 
the first test. Since this study had an 
exploratory character, such a control 
was not used this time. However, it 
should be clearly understood that the 
first experience with these test items in 
question, for these subjects, involved 
only activity terminating in failure 
to achieve solution. No information 
about the correct solutions was given. 

A striking number of instances of 
success in achieving correct solutions 
to learning set tasks was found follow- 
ing learning as compared with before. 
These results are summarized in 
Table 3. Although for learning set 
IIB the percentage of success was 
only 50% (with two cases), there 
two learning sets for which 
success was achieved, and the 


were 
100% 
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TABLE 3 


NUMBER OF INSTANCES OF PASSING AND 
FAILING FINAL TASK AND SUBORDINATI 
LEARNING Set TASKS BEFORE AND 
AFTER ADMINISTRATION OF AN 
ADAPTIVE LEARNING PRO- 

GRAM, IN A GROUP OF 
SEVEN NINTH- 

GRADE Boys 


| Number | Number 
—" failing of these 
Task | before passing after 
| learning learning 


Percentage 
success 


Final Task | 86 

I | 
Il | 
IIB 
IIIB 
Total 


percentage for all instances combined 
These results provide addi- 
tional evidence compatible with the 
idea of the knowledge hierarchy. 

The learner in such a program does 
not “practice the final task”; he ac- 


was 86%. 


quires specifically identified capabili- 
ties in a specified order. In as many 
as six out of seven cases, we were able 
by this means to bring learners from 
various levels of competence all the 
way to final task achievement. (It is 
perhaps important that the exception 
was JR, one of two who had most to 
learn.) Of course, it must be recog- 
nized that two separable causes con- 
tribute to the effects of the learning 
program in this study: (a) the cor- 
rectness of the learning set analysis; 
and (b) the specific effectiveness of the 
instructions contained in the learning 
program. 


IMPLICATIONS FOR INDIVIDUAI 
DIFFERENCES MEASUREMENT 


It is evident that learning sets, as con- 
ceived in this paper, operate as “indi- 
vidual differences” variables, which, 
when suitably manipulated, also be- 
come “experimental” variables. There 
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are some additional implications which 
need to be pointed out regarding the 
functioning of learning sets in the de- 
termination of measured individual 
differences. 

As the process of identification of 
subordinate learning sets is progres- 
sively continued, one arrives at some 
learning sets which are very simple and 
general, and likely to be widespread 
within the population of learners for 
which the task is designed. Consider, 
for example, learning set IVB (Figure 
1), which is represented by a task such 
as 4 X 2=5+? Ifone makes a further 
analysis to identify a subordinate learn- 
ing set for this task, the answer appears 
to be, “adding, subtracting, multplying, 
and dividing one- and two-place num- 
bers.” It is interesting to note that 
this is exactly the task provided by a 
set of factor reference tests (French, 
1954) called Number. In a similar 
manner, the other two circled entities 
in the last row of Figure 1 were 
identified. One is Symbol Recogni- 
tion (called Associative Memory by 
the factor researchers), and Recogni- 
tion of Patterns (called Flexibility 
of Closure). The implication is, there- 
fore, that these simplest tasks, identi- 
fied by factor analysis techniques as 
common to a great variety of human 
performances, also function as learning 
sets. 

The hypothesis has been proposed 
that learning mediate positive 
transfer to higher-level tasks. Very 
often, if not usually, the measurement 
of transfer of training implies that a 
second task is learned more rapidly 
when preceded by the learning of an 
initial task, than when not so preceded. 
Accordingly, it seems necessary to dis- 
tinguish between expected correlations 
of these basic factors (at the bottom of 
the hierarchy) with rate of attainment 
of higher-level learning sets on the one 
hand, and correlations of these same 


sets 
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factors with achievement of higher- 
level learning sets on the other. 

The implications of this line of rea- 
soning would seem to be somewhat as 
follows: Factors which are found by 
the kind of psychological analysis pre- 
viously described to lie at the bottom of 
the knowledge hierarchy should exhibit 
certain predictable patterns of correla- 
tion with higher-level learning sets. 
They should correlate most highly with 
rate of attainment of the learning sets 
in the next higher level to which they 
are related, and progressively less as 
one progresses upwards in the hier- 
archy. The reason for this is simply 
that the rate of attainment of learning 
sets in a hierarchy comes to depend to 
an increasing extent on the learning 
sets which have just previously been 
acquired and accordingly to a decreas- 
ing extent upon a basic factor or abil- 
ity. Some analogy may be drawn here 
with the findings of Fleishman and 
Hempel (1954) on motor tasks. 

The expected relationships between 
factor test scores and achievement 
scores (passing or failing learning 
sets) throughout such hierarchies seem 
to require a somewhat more complex 
derivation. First of all, such relation- 
ships will depend upon the effective- 
ness of a learning program, or perhaps 
on the effectiveness of previous learn- 
ing. If the learning program is per- 
fectly effective, for example, and if 
differences in rate of attainment are 
ignored, everyone will pass all the 
learning set tasks, and the variance will 
accordingly be reduced to zero. Under 
these circumstances, then, one may 
expect all correlations with basic fac- 
tors to be zero. However, one must 
consider the case in which the learning 
program is not perfectly effective. In 
such a case, the probability that an 
individual will acquire a new learning 
set, as opposed to not acquiring it, will 
presumably be increased to the extent 
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that he scores high on tests of related 
abilities. If continues to 
collect scores on learning set tasks of 
both successful achievers and those 
who fail, the result will presumably be 
an increasing degree of correlation be- 
tween basic ability scores and learning 
set tasks as one progresses upwards in 
the hierarchy. The reason for this is 
that the size of the correlation comes 
to depend more and more upon vari- 
ance contributed by those individuals 
who are successful, and less and less on 
that contributed by those who effec- 
tively “drop out.” 

The difference in expectation be- 
tween the increasing pattern of correla- 
tion with achievement scores, and the 
decreasing pattern with measures of 
rate of attainment, is considered to be 
of rather general importance for the 
area of individual differences measure- 
ment. Confirmatory results have been 
obtained in a recent study (Gagné & 
Paradise, 1961) concerned with the 
class of tasks “solving linear algebraic 
equations.” 


basic one 


DISCUSSION 


The 


productive 
learning implied in this paper is that 
it is a matter of transfer of training 
from component learning sets to a new 
activity which incorporates these pre- 


general view of 


viously acquired capabilities. This 
new activity so produced is qualita- 
tively different from the tasks which 
correspond to the “old” learning sets; 
that is, it must be described by a dif- 
ferent set of operations, rather than 
simply being “more difficult.” The 
characteristics of tasks which make 
achievement of one class of task the 
required precursor of achievement in 
another, and not vice versa, are yet to 
be discovered. Sufficient examples ex- 
ist of this phenomenon to convince one 
of its reality (Gagné, Mayor, Garstens, 
& Paradise, 1962; Gagné & Paradise, 
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1961). What remains to be done, pre- 
sumably, is to begin with extremely 
simple levels of task, such as discrimi- 
nations, and investigate transfer of 
training to tasks of greater and greater 
degrees of complexity, or perhaps ab- 
stractness, thus determining the dimen- 
sions which make transfer possible. 
The path to research on the charac- 
teristics of instructions appears more 
straightforward, at least at first glance. 
The establishment of identifications is 
a matter which has been investigated 
extensively with the use of paired as- 
The employment of instruc- 
tions for this purpose may need to take 
into consideration the necessity for 
learning differentiations among the 
stimulus items to be identified, as well 
as other variables suggested by verbal 
learning studies. The function of in- 
ducing high recallability would seem to 
be a matter related to repetition of 
learning set tasks, and may in addition 
be related to time variables such as 
those involved in distribution of prac- 
tice. As for guidance of thinking, the 
distinguishing of this function from 
others performed by _ instructions 
should at least make possible the de- 
sign of more highly analytical studies 
than have been possible in the past. 
In the meantime, the approach em- 
ployed in the experiment reported here, 
of proceeding backwards by analysis of 
an already existing task, has much 
to recommend it as a way of under- 
standing the learning of school sub- 
jects like mathematics and science, and 
perhaps others also. Naturally, every 
human task yields a different hier- 
archy of learning sets when this 
method of analysis is applied. Often, 
the relationship of higher to lower 
learning sets is more complex than 
that exhibited in Figure 1. It should 
be possible, beginning with any exist- 
ing class of tasks, to investigate the 
effects of various instruction variables 


sociates. 
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within the framework of suitably de- 
signed learning programs. 


The major methodological implica- 
tion of this paper is to the effect that 
investigations of productive learning 
must deal intensively with the kinds of 
variables usually classified as “indi- 
vidual One cannot de- 
pend upon a measurement of general 
proficiency or aptitude to reveal much 
of the important variability in the capa- 
bilities people bring with them to a 
given task. Consider, for example, the 
seven ninth-grade boys in our study. 
Each of them had “had” algebra, and 
each of them had “had” arithmetic. 
There was no particularly striking rela- 
tionship between their ultimate per- 
formance and their previous grades in 
algebra (although there is no doubt 
some correlation), nor between this 
performance and “general intelligence.” 
Sut the measurement of their learning 
sets, as illustrated in Table 1, revealed 
a great deal about how they would be- 
have when confronted with the learn- 
ing program and the final task. For 
some, instructions had to begin, in ef- 
fect, down” than for others. 
Some could do Task 1 right away, 
while others could not, but could do it 
equally provided they learned 
other things first. The methodological 
point is simply this: if one wants to 
investigate the effects of an experi- 
mental treatment on the behavior of 
individuals or groups who start from 
the same point, he would be well ad- 
vised to measure and map out for each 
individual the learning sets relevant to 
the experimental task. In this way he 
can have some assurance of the extent 
to which his subjects are equivalent. 


differences.” 


“lower 


well 
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The question of whether logic is de- 
scriptive of the thinking process, or 
whether its relation to thinking is nor- 
mative only, seems to be easily an- 
swered. Our reasoning does not, for 
example, ordinarily follow the syllo- 
gistic form; and we do fall into contra- 
dictions. On the other hand, logic 
unquestionably provides criteria by 
which the validity of reasoning may be 
evaluated. Logical forms thus do not 
describe actual thinking, but are con- 
cerned with the ideal, with “how we 
ought to think.” And yet a problem 
seems to be concealed beneath this easy 
solution. 

It is interesting to note that a num- 
ber of the older writers on logic 
regarded their discipline as the science 
of thought. It will not be possible 
here to survey the vast literature, some 
of which bears only by implication on 
our problem, but some examples will 
illustrate this position. Cohen and 
Nagel (1934) summarize the older 
view: “An old tradition defines logic 
as the science of the laws of thought” 
(p. 18). Kant (1885) holds that 
“logic is a science of the necessary laws 
of thought, without which no employ- 
ment of the understanding and the rea- 
son takes place” (p. 3). Psychology, 
on the other hand, supplies only the 
contingent, not the necessary, rules of 
thought. John Stuart Mill (1874) 
likewise views logic as comprising “the 
science of reasoning, as well as an art, 
founded on that science” (p. 18). He 
continues : 

*This paper was written when the author 


was a Fellow of the John Simon Guggen- 
heim Memorial Foundation. 
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Whatever has at any time been concluded 
justly, whatever knowledge has been acquired 
otherwise than by immediate intuition, de- 
pended on the observance of the laws which 
it is the province of logic to investigate. 
If the conclusions are just, and the knowl- 
edge real, those laws, whether known or not, 
have been observed (p. 22). 


Boole (1854), as a final instance, re- 
gards “the laws of the symbols of 
Logic” as “deducible from a considera- 
tion of the operations of the mind in 
reasoning” (pp. 45-46). Boole de- 
duces the law of contradiction, for ex- 
ample, from a fundamental law of 
thought (pp. 49-51). 

More recent writers, on the other 
hand, have tended to reject the view 
that the laws of logic are those of the 
human understanding. Again, only a 
few illustrations will be given. Cohen 


(1944) remarks: 


That the laws of logic are not the uni- 
versal laws according to which we do actually 
think is conclusively shown, not only by the 
most elementary observation or introspection, 
but by the very existence of fallacies (pp. 
2-3). 


A similar point is made by Nagel 
(1956) : 


Little need be said in refutation of the view 
that logical principles formulate the “inherent 
necessities of thought” and are generalized 
descriptions of the operations of minds. 
Surely the actual occurrence of beliefs in 
logically incompatible propositions makes 
nonsense of the claim that the principle of 
noncontradiction expresses a universal fact 
of psychology (p. 66). 


Regarding the work of Boole referred 
to above (An Investigation of the Laws 
of Thought), Cohen and Nagel (1934) 
comment: “The title is a misnomer” 
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(p. 112). 
of this point of view is by 


Russell (1904) : 


Throughout logic and mathematics, the 
existence of the human or any other mind 
is totally irrelevant; mental processes are 
studied by means of logic, but the subject 
matter of logic does not presuppose mental 
processes, and would be equally true if there 
were no mental processes (p. 812). 


A more extreme statement 
Jertrand 


Schiller (1930), too, holds that syllo- 
gistic reasoning “has nothing whatever 
to do with actual reasoning, and can 
make nothing of it” (p. 282). He de- 
scribes the “Laws of Thought” (laws 
of identity, contradiction, and excluded 
middle) as “verbal conventions” (p. 
251). 

The changed point of view with re- 
gard to the relation between logical 
principles and the laws of thought 
seems to be a function of an altered 
intellectual climate rather than of any 
fundamental discoveries about the na- 
ture of reasoning. It thus seems worth- 


while to reopen the question, the more 
so since it has implications for a num- 
ber of central issues in the psychology 


of thinking. First, however, it may be 
of interest to examine a few more forms 
in which the present question has been 
raised. 

Discussion of the figures of the syllo- 
gism has at times centered on their 
relevance to actual thinking. J. N. 
Keynes (1887) cites several writers 
who reject the fourth figure because 
they hold that we do not actually rea- 
son in it (pp. 230-231). Kant (1885), 
indeed, finds all but the first figure both 
useless and false; the fourth in particu- 
lar he calls unnatural (pp. 84-90). 
Keynes, on the other hand, argues for 
the admission of Figure 4 on the same 
grounds, namely its relevance to actual 
thinking: “It is not actually in frequent 
use, but reasonings may sometimes not 
unnaturally fall into it” (p. 232). 

Psychologists investigating reasoning 
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processes have tended to underempha- 
size the role of logic in the thinking of 
their subjects. To illustrate, Bruner, 
Goodnow, and Austin (1956) suggest 
that 


much of human reasoning is supported by 
a kind of thematic process rather than by an 
abstract logic. The principal feature of this 
thematic process is its pragmatic rather than 
its logical structure (p. 104). 


Individuals tend to prefer “empirically 
reasonable propositions” to logical ones 
(p. 104). Morgan and Morton (1944) 
conclude that : 


A person is likely to accept a conclusion 
which expresses his convictions with little 
regard for the correctness or incorrectness 
of the inferences involved. Our evidence will 
indicate that the only circumstance under 
which we can be relatively sure that the in- 
ferences of a person will be logical is when 
they lead to a conclusion which he has 
already accepted (p. 39). 


Lefford (1946) states that the princi- 
ples of logical inference “‘are techniques 
which are not the common property of 
the unsophisticated subject” (p. 144). 
He goes so far as to distinguish from 
the logical inference 


psychological inferences which may be made 
by the ordinary person A psychological 
inference is not valid or invalid except when 
judged as a logical inference: psychological 
inference is purely a fact (p. 145). 


Common to all these statements by 
the psychologists is the assumption that 
logical principles are irrelevant, if not 


antithetical, to much actual reasoning.” 
This conclusion is derived from the 


2A different, but related, position is that 
represented by Dollard and Miller (1950), 
who hold that being logical is a learned drive. 
The child, it is argued, is punished for logi- 
cal contradictions and absurdities, for il- 
logical and contradictory plans. The result 
for most people is “a learned drive to make 
their explanations and plans seem logical” 
(p. 120). The implication is, of course, that 
without this specific training the individual’s 
thinking would not be (or seem) logical. 
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high incidence of wrong inferences of 
subjects under test conditions, espe- 
cially in the case of emotionally relevant 
material. A different conclusion has 
been drawn by Von Domarus (1944) 
from observation of errors in the rea- 
soning of schizophrenic patients. This 
author argues, not that the reasoning of 
his subjects is unrelated to logic, but 
rather that it conforms to a logic whose 
laws are different from those of Aristo- 
telian logic. This is “paralogic,” which 
excludes the law of contradiction and 
“accepts identity based upon identical 
predicates” (p. 111). This idea has 
been elaborated by Arieti (1955) who 
sees the operation of this logic (which 
he calls paleologic) not only in schizo- 
phrenic and primitive thinking, as Von 
Domarus does, but also in dreams, in 
some infantile thinking, and in the 
transference situation in psychoanaly- 
sis. 

Again it is of interest to find that 
the earlier writers mentioned 
were equally aware of the problem of 
error, but viewed it in a way that was 
entirely compatible with their concep- 
tion of logic as the science of the laws 
of the mind. “It is easy to see how 
truth is possible,” writes Kant (1885, 
p. 44) “since in it the understanding 
acts according to its own essential 
Error, however, is difficult to 
“a form 


above 


laws.” 
understand since it constitutes 
of thought inconsistent with the under- 
standing.” Its source is thus not to be 
sought in the understanding itself, but 
rather in the “unobserved influence of 
the sensibility on the understanding,” 
the sensibility being that faculty which 
“supplies the material for thought.” 
Boole (1854) likewise considers that 
“the phaenomena of incorrect reason- 
ing or error . . . are due to the inter- 
ference of other laws with those laws 
of which right reasoning is the prod- 
uct” (p. 409). He reminds us that 
“the laws of correct inference may be 
violated, but they do not the less truly 
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exist on this account” (p. 408). Mill 
(1874) is still more explicit. Discuss- 
ing fallacies of ratiocination, he points 
out that since 


the premises are seldom formally set out,... 


it is almost always to a certain degree 
optional in what manner the suppressed link 
shall be filled up. . . . [A person] has it 
almost always in his power to make his 
syllogism good by introducing a false prem- 
ise; and hence it is scarcely ever possible 
decidedly to affirm that any argument in- 
volves a bad syllogism (p. 560). 


In the case of arguments consisting 
not of a single syllogism but of a chain 
of syllogisms, he considers the common- 
est fallacy of ratiocination to lie in a 
changing of the premises as the argu- 
ment proceeds. 

Two clearly contrasting alternatives 
thus present themselves: Is logic (or 
Aristotelian logic) largely irrelevant to 
the thinking process, or is it concerned 
with the laws of thinking? Since we 
will here be concerned only with deduc- 
tive reasoning, we may reformulate the 
question more specifically in 
terms. But since, as has so often been 
pointed out, the premises from which 
we reason are commonly not spelled 
out, since our inferences so frequently 
appear as enthymemes,* this fact must 
be taken into consideration. We may 
ask: If we know the premises—tacit 
as well as explicit—from which a per- 
son reasons, can we put the process in 
syllogistic form? Do the rules of the 
syllogism describe processes that the 
mind follows in deductive reasoning, 
even when the syllogistic form is not 
explicitly employed? 

It has been shown above that the 
existence of error has been used as evi- 
dence for the irrelevance of logic to the 
actual thinking process. On the other 
hand, a different interpretation of error 


these 


8“A syllogism that is incompletely stated, 
in which one of the premises or the conclusion 
is tacitly present but not expressed, is called 
an enthymeme” (Cohen & Nagel, 1934, p. 
78). 
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has been suggested. Since the problem 
of error seems to be a particularly fruit 
ful one in which to join the issues 
before us, it will be taken as the con- 
text for the present discussion. Once 
more we may reformulate our question 
as follows: Do errors in deductive rea- 
soning mean that the logical process has 
been violated? As Mill expresses it, 
does the occurrence of error mean that 
the syllogism is a bad one? Or can 
the error be accounted for otherwise? 
Is it possible that a process that would 
follow the logic if it were 
spelled out is discernible even when the 


rules of 


reasoning results in error? 

The distinction being made here is 
a familiar one in the psychology of 
learning and thinking. Thus Koffka 
(1935, Ch. 12) distinguishes between 
learning as accomplishment and the 
learning processes responsible for this 
accomplishment. Kohler’s (1927) dis- 
tinction “good and 
stunid ones is likewise relevant. Good 
errors, he points out, “may, in a cer- 
tain sense, be absolutely appropriate to 
the situation” (p. 217), although they 
solve the problem no more than do 
Wertheimer (1959), too, 
between 
tained by “blind” procedures and “fine, 
genuine solutions.” Again the differ- 
ence is one of process, since in both 


between errors” 


stupid ones. 


distinguishes solutions ob- 


cases the result may be the same. In 
the same way, in connection with the 
present problem, we may ask: Given 
contrasting results—correct solutions 
and errors in deductive reasoning— 
what can we say about the thinking 
processes that account for them? Are 
the processes necessarily different be- 
cause their effects are different ? 
Illustrative data that 
issue will be presented 


this 
They were ob- 
tained from 46 graduate students of 
psychology who were asked to evaluate 


bear on 


the logical adequacy of deductions pre- 
sented in the context of everyday prob- 
lems. Most of the subjects had no 
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training in formal logic.‘ The material 
was presented under group conditions, 
the subjects writing out their judg- 
ments and their grounds for making 
them. Instructions included an explicit 
statement that the logical adequacy of 
the arguments was to be judged, not 
the truth of the statements. 

Individual interviews were also con- 
ducted with a different set of subjects. 
Although the results agree generally 
with those obtained under group condi- 
tions, they will not be presented because 
of a problem already mentioned. Mill, 
it has been seen, pointed to the difficulty 
of recognizing a bad syllogism because 
a person can easily introduce into his 
incompletely stated argument a new 
premise that will make the inference 
valid. It was often difficult to decide 
whether the material elicited by direct 
questioning consisted of new premises 
not entering into the original inference, 
or whether the subject was, as intended, 
making clear his original understanding 
of the premises. Although the group 
data are less rich than the individual 
interviews, and although the products 
of written communication are likely to 
be less spontaneous reflections of the 
thinking process than interviews, they 
exclude this possibility of overlooking 
true fallacies ; indeed, if anything, they 
overestimate the incidence of bad syllo- 
gisms, since the reasoning is often in- 
completely described in the case of 
apparent errors.® 


The present data will be used only to 
illustrate the reasoning processes in 
cases of error; no quantitative results 


4It must be recognized, of course, that the 
lack of formal training in logic is insufficient 
guarantee of the naiveté of subjects. It can 
only be pleaded that we have no better cri- 
terion at the present time. Whatever the in- 
formal self-education of subjects in this re- 
spect, the results to be presented cannot be 
regarded as the product of formal instruction 

in logic. 
A sample interview may be found in 


Henle (1958). 
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will be presented. As many authors 
have shown, the incidence of error in 
deductive reasoning depends on the 
form of the syllogism and its contents, 
as well as on instructions to the sub- 
jects. Quantitative results would have 
relevance only to the particular condi- 
tions studied here, whereas an inquiry 
into the nature of the errors obtained 
might be of more general interest. 

Several may be distin- 
guished which led to error in dealing 
with the presented material: 


processes 


ro AccEPT THE LOGICAL 


TASK 


I’ AILURI 


More specifically, this means failure 
to distinguish between a conclusion that 
is logically valid and one that is fac- 
tually correct or one with which the 
subject This source of error 
has already been reported by Henle 
and Michael (1956, p- 124). 

A sample syllogism follows, along 
which errors occur 


agrees. 


with responses in 
because of failure to grasp or accept the 
logical task. 

Syllogism 6. A group of 
were discussing their household prob 
lems. Mrs. Shivers broke the ice by 
saying: “I’m so glad we’re talking 
It’s so impor- 


women 


about these problems. 
tant to talk about things that are in 
our minds. We spend so much of our 
time in the kitchen that of course house- 
hold problems are in our minds. So it 
is important to talk about them.” (Does 
it follow that it is important to talk 
about them? Give your reasoning. ) 

Responses: “The conclusion does not 
follow. The women must talk about 
household problems because it is im- 
portant to talk about their problems, 
not because the problem is in their 
minds.” 

“No. It is not important to talk 
about things that are in our minds un- 


less they worry us, which is not the 


case. 
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“No. Just because one spends ‘so 
much time’ in the kitchen it does not 
necessarily follow that household prob- 
lems are ‘in our minds’.” 

It should be noted that subjects 
who failed to accept the logical task 
frequently gave correct responses that 
are just as irrelevant to the question of 
the relation of logic to the thinking 
process as are the errors just cited. A 
few examples follow : 

“Yes. It could be very important to 
the individual doing the talking and 
possibly to some of those listening, be- 
cause it is important for people to ‘get 
a load off their chest,’ but not for any 
other reason, unless in the process one 
or the other learns something new and 
of value.” 

“Yes. It seems obvious that prob- 
lems which are in the forefront of one’s 
mind bring more consideration to them 
and possibly newer aspects when they 
are discussed with another. Two heads 
may be better than one.” 

“Yes it does. By talking household 
problems, a problem can be solved or 
worked through.” 

The errors illustrated here clearly do 
not demonstrate an inability of the sub- 
jects to reason logically, since they have 
not accepted the logical task. They 
have evaluated the content of the con- 
clusion, not the logical form of the 
argument. Richter (1957, p. 341) 
makes the same distinction, carrying 
the analysis a step farther. Apart from 
careless and those 
from “imperfections in the classifying 
operation” —i.e., from an inability to 
make logical deductions, he describes 
errors arising from “a general failure 
to grasp the concept of ‘logical va- 
lidity’” and those arising from “the 
specific inability to differentiate ‘logical 
validity’ from another attribute of syl 
logisms,” namely their factual status. 
To apply to the present data, the factual 
criterion needs to be interpreted to 


mistakes arising 





Tue RELATION BETWEEN LoGic AND THINKING 


mean what is believed to be true or 
reasonable as well as what is known 
to be true. With Richter’s distinction 
in mind, we may note that no subject 
failed completely to grasp the logical 
task. Still, one individual judged on 
the basis of belief rather than logical 
validity in the case of 9 out of 10 syl- 
logisms, and several others judged in 
this manner with at least half of the 
syllogisms presented. 


RESTATEMENT OF A PREMISE OR 
CONCLUSION SO THAT THE 
INTENDED MEANING IS 
CHANGED 


In a number of cases examination 
of the material from which a subject 
reasoned showed that it differed from 
that which was originally presented. 
In such cases the validity of the argu- 
ment can, of course, be judged only in 
relation to the syllogism actually em- 
ployed, not the one intended. 

1. In the case of syllogisms in which 
a conclusion from two particular prem- 
ises was to be evaluated, the premises 
were occasionally restated as univer- 
from which the fol- 
lowed. The following syllogism per- 
mitted this kind of change: 

Syllogism 8. Mrs. Cooke had stud- 
ied home economics in college. “Youth 
is a time of rapid growth and great 
demands on energy,” she said. “Many 
youngsters don’t get enough vitamins 
in their daily diet. And 
vitamin deficiencies are dangerous to 
health, it that the health of 
many of our youngsters is being en- 
dangered by inadequate diet.” (Does it 
follow that the health of many young- 
sters is being endangered by inadequate 
diet? Give your reasoning. ) 

Responses: “Yes. Youngsters (A) 
don’t get vitamins (B). Not getting 
vitamins (B) dangerous to health (C). 
Youngsters (A) in danger of poor 


health (C).” 


conclusion 


sals, 


since some 


follows 
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“This follows. .. . Youth doesn’t get 
enough vitamins. Vitamins are neces- 
sary or bad health results. Therefore 
youth are endangered. All of these 
follow logically, and I believe a correct 
inference has been made.” 

2. Other changes were introduced 
into the premises by subjects whose 
reasoning was found to be correct even 
though their answers to the intended 
problem were wrong. The same syllo- 
gism (Syllogism 8) will be used to 
illustrate. 

Responses: “There is some question 
about the equivalence of ‘don’t get 
enough vitamins’ and a vitamin defi- 
ciency dangerous to health. If one 
assumes their equivalence, then the con- 
clusion just about manages to follow.” 

“Correct if we assume that the 
youngsters are lacking those vitamins 
in their diet which endanger health.” 

“Tt seems to follow, assuming that the 
deficient vitamins are the vital 
ones.” 

Everyday discussions permit greater 
scope for such changes in meaning than 
does the material employed here. Pri- 
vate meanings of terms or idiosyn- 
cratic or unconscious equations of con- 
cepts may enter in to prevent two 
people from drawing the same conclu- 
sion even though they start ostensibly 
from the same material. 


also 


OMISSION OF A PREMISE 


Occasionally a subject employed only 


one of the presented premises. The 
informal manner in which the premises 
were set out made possible this treat- 
ment of the material. Syllogism 6 (cf. 
above) will be used to illustrate. 

Responses: “Not correct logically. 
(a) A group of women spend much 
of their time in the kitchen; (6) thus 
household problems occupy their minds. 
... (c) Therefore it is important to 
talk about them. Mrs. Shivers’ sub- 
jective thinking.” 
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“Doesn't follow from the preceding 
statement. Because she spends a lot 
of time in the kitchen and household 
problems are on her mind, it doesn’t 
follow that it is important to talk about 
them.” 

In a few cases, indeed, both of the 
intended premises were omitted by the 
subject. For example (Syllogism 6) : 

“No it doesn’t follow. The fact that 
she spends much time in the kitchen 
has nothing to do with whether or not 
it is important to talk about the prob 
lems.” 

It will be seen that in all these cases 
in which a premise has been omitted, 
the subject correctly reports that the 
conclusion does not follow. If we dis- 
regard the intended premises and con- 
sider only the material actually used 
in reasoning, the subjects’ deductions 
are seen to be correct. 


SLIPPING IN OF ADDITIONAL 
PREMISES 


Under the present conditions this de- 
vice was infrequent, but it seems to 
be more common where premises are 
stated less completely, and where the 
issues are of more immediate practical 
concern, as in everyday discussion. 
For example, premises may be added 
that are so commonplace, so much taken 
for granted, that they escape attention. 

Instances of the adding of assump- 
tions to our material follow. Since they 
are readily understandable, and for the 
sake of brevity, the responses will be 
quoted without the presented syllogism. 

Responses: “Without intending 
sophistry it must be pointed out that 
the implicit premise is ‘crime is bad.’ 
If this is granted, the conclusion follows 
that if comics contribute to influence 
the youth along criminal paths, they 
should be eliminated.” 

“  . . If comic books are an evil in- 
fluence, then they should be got rid 
of.” Here the subject is clearly em- 


ploying an implicit premise that he 
added to the material presented, namely 
“Whatever is an evil influence should 
be got rid of.” 

Again in these cases, wrong answers 
are obtained by correct reasoning if we 
consider the syllogism as the subject 
understood it rather than the one the 
investigator hoped to present. 

When all the above processes have 
been taken into account, a considerable 
number of errors remain. Do these 
represent true fallacies? It is difficult 
to say since in these cases either no 
account of the reasoning process is sup- 
plied, or an unclear explanation is 
given, or else the subject merely sets 
out the premises and conclusion given 
to him without comment. It might be 
that many of these errors would have 
been found on examination to involve 
processes similar to the ones described 
above, so that no error in reasoning is 
involved. On the other hand, the pos 
sibility of true fallacies cannot be ex- 
cluded. 

In connection with the unexplained 
errors, it is worth noting that almost 
as many correct 
were unexplained or unclearly ex- 
plained. The absence of explanation is 
thus no reason to suspect a reasoning 


res] onses as errors 


process of being fallacious. 
It must also be mentioned that the 


processes described above as producing 
apparent errors although the deductive 


process is correct are by no means 
viewed as constituting a complete list. 
They were observed with particular 
materials and under particular condi- 
tions; and it is to be anticipated that 
further research will expand this list. 
Chapman and Chapman (1959), for 
example, have suggested several addi- 
tional processes by which error is pro 
duced in syllogistic reasoning, one of 
which involves a misunderstanding of 
the task different from that described 
here. 
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Despite the limitations of the present 
data, they do show clearly that when 
subjects arrive at apparently invalid 
conclusions, or when they fail to spot 
a fallacy, they often do so because they 
have with materials different 
from those intended or because they 
have undertaken a task different from 
the one intended. In such 


worked 


cases, if 


we consider the materials and task as 
they were actually understood by indi- 
vidual subjects, we fail to find evidence 


of faulty reasoning. It must be con- 
cluded that the presence of error does 
not constitute evidence that the laws 
of logic are irrelevant to actual think- 
ing. The data tend, rather, to support 
the older conception that these laws are 
thinking 


widely discernible in the 


process. 


DISCUSSION 


\n examination of errors in syllo 
gistic reasoning leads to two conclu 
(a) While the possibility of fal- 
lacy often cannot be excluded it is, in 
Mill’s words, 
decidedly to affirm that any argument 
involves a bad syllogism.” (>) Where 
error occurs, it need not involve faulty 


sions: 


“scarcely ever pr »ssible 


function of 
the individual’s understanding of the 
task or the materials presented to him. 
Spinoza (1930) has stated our present 


reasoning, but may be a 


conclusion in more general terms: 


When men make errors in calculation, the 
numbers which are in their minds are not 
those which are upon the paper. As far as 
their mind is concerned there is no error, 
although it seems as if there were, because 
we think that the numbers in their minds are 
those which are upon the paper. If we did 
not think so, we should not believe them to 
be in error. For example, when I lately 
heard a man complaining that his court had 
flown into one of his neighbour’s fowls, I 
understood what he meant, and therefore 
did not imagine him to be in error. This is 
the source from which so many controversies 
arise—that men either do not properly ex- 
plain their own thoughts, or do not properly 
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interpret those of other people; for, in truth, 
when they most contradict one another, they 
either think the same things or something 
different, so that those things which they 
suppose to be errors and absurdities in 
another person are not so (from the Ethics, 
Scholium Proposition 47, Part 2). 


Such a conclusion, which the present 
analysis supports, is of some import- 
ance if, as Mill (1874) writes, “to 
draw inferences has been said to be 
the great business of life” (p. 21). 
Our data suggest that the intelligent 
adult is not so inept at carrying on this 
business as several recent 
have made him out to be. 

In considering the place of logical 
inference in everyday life, it is neces- 
sary also to take into account what 
Aristotle (1945) has termed the “prac- 
tical syllogism,” the case in which 


accounts 


the conclusion drawn from the two premisses 
becomes the action. For example, when you 
conceive that every man ought to walk and 
you yourself are a man, you immediately 
walk; or if you conceive that on a particular 
occasion no man ought to walk, and you 
yourself are a man, you immediately remain 
at rest. In both instances action follows 
unless there is some hindrance or compulsion. 
.. . Again, I need a covering, and a cloak 
is a covering, I need a cloak. What I need 
I ought to make; I need a cloak, I ought to 
make a cloak. And the conclusion “I ought 
to make a cloak” is an action. The action 
results from the beginning of the train of 
thought. If there is to be a cloak, such and 
such a thing is necessary, if this thing then 
something else; and one immediately acts 
accordingly. That the action is the con- 
clusion is quite clear. . . . (70la). 


Aristotle points out that this whole 
process need not be explicit. 


The mind does not stop and consider at all 
one of the two premisses, namely, the obvious 
one; for example, if walking is good for a 
man, one does not waste time over the 
premiss “I am myself a man.” Hence such 
things as we do without calculation, we do 
quickly (70la). 


Again, “My appetite says, I must 
drink ; this is drink, says sensation or 
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imagination or thought, and one im- 
mediately drinks.” 

If we include, as it seems we must, 
the practical syllogism in the “great 
business of life,” that of drawing in- 
ferences, it becomes difficult incleed to 
accept the conclusion that human 
beings are unable to reason logically. 
A couple of contemporary examples 
will suggest how ubiquitous is the 
practical syllogism in everyday life. 
“It is too far to walk to the Public 
Library ; I must take a subway or bus. 
The Fifth Avenue bus passes the 
Public Library. I must take the Fifth 
Avenue bus.” “I do not want to wear 
the same dress two days in succession. 
I wore this dress yesterday; so I do 
not want to wear this dress today.” It 
is difficult to see how the individual 
could cope with the ordinary tasks of 
life if the practical syllogism embodied 
techniques which are not, as one author 
quoted above put it, “the common 
property of the unsophisticated sub- 
ject,” if, indeed, it were not a natural 
mode of functioning of the conscious 
mind. 

Furthermore, if people were unable 
to reason logically, so that each arrived 
at different conclusions from the same 
premises, it is difficult to see how they 
could understand each other, follow 
one another’s thinking, reach common 
decisions, and work together. 

It might be argued that in the case 
of the practical syllogism we are deal- 
ing not with an implicit logical process, 
but with a learning process in which 
each response is cued off by the pre- 
ceding one. There seem to be at least 
two grounds for rejecting such an in- 
terpretation. (a) There are cases in 
which the practical syllogism leads to 
a solution which is genuinely novel for 
the individual. This is surely so for 
some of the examples cited above, at 
least the first time they occur. Indeed, 
if the responses were habitual ones, the 
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reasoning process would not be neces- 
sary. (b) Our data obtained from 
conventional syllogisms suggest that a 
logical process often occurs even in 
cases in which the reasoning seems 
fallacious. Such data can thus not be 
interpreted in terms of the “learned 
drive” postulated by Dollard and 
Miller (1950, p. 120) “to make 
explanations and plans seem logical.” 
And since they sometimes lead to 
novel conclusions (cf. Wertheimer, 
1938), they cannot be interpreted as 
habitual responses. In the interests of 
parsimony, therefore, a common ex- 
planation for the practical syllogism 
and the verbal one seems preferable. 
An interpretation in terms of an im- 
plicit logical process would fit both 
kinds of case. 

Two further implications of the 
present data may be mentioned. They 
suggest an approach to the problem of 
the influence of needs and attitudes 
on the reasoning process. For the 
problem remains: why do motivational 
influences so often appear to impair 
thinking? It is a plausible hypothesis 
that these influences do not distort the 
reasoning process, as has frequently 
been stated or implied—indeed that 
they do not act at all on the reasoning 
process—but rather that they affect the 
materials with which thinking works. 
Although this is not the place for a 
detailed analysis, one or two specific 
effects of this kind may be suggested. 
(a) It may be that a strong attitude 
toward, or emotional involvement with, 
particular material is in part respon- 
sible for the difficulty which many un- 
sophisticated subjects experience in 
distinguishing between drawing a con- 
clusion that is logically valid and one 
that is believed to be correct. Over a 
wide range of practical situations these 
two tasks are not in conflict; and the 
distinction is thus not an easy’ one. 
There is evidence that attitudes and 
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emotions may limit the ability to make 
distinctions (cf. Henle, 1955). If this 
suggestion is correct, the more per- 
sonally relevant the material employed, 
the more difficult it will be to accept 
the logical task. Such an effect might 
go a long way toward explaining the 
report of a number of writers that in- 
dividuals are unable to reason logically 
about emotionally toned material. (b) 
An attitude can select from among the 
possibilities that the material presents, 
singling out, for example, one among 
several possible meanings. It may be 
that such a process is responsible for 
the changes in the meaning of premises 
observed in our data. (For possibili- 
ties of further analysis of this kind, cf. 
Henle, 1955.) 

That motivational processes may act 
on reasoning in the manner suggested 
here receives support from a recent 
study by Kopp (1960), who subjected 
to a similar analysis reasoning proc- 
related to the delusions of 
His subjects, 
like the present ones, made errors 
which were found to lie more in the 
premises from which the reasoning 
proceeded than in the actual drawing 
of inferences. 

In this connection, it would be in- 
teresting to analyze Arieti’s (1955) ex- 
amples of “paleological” thinking in the 
terms of the present study. Unfortu- 
nately, the published data are too scant 
to permit such an analysis. The ex- 
amples presented would be fallacies in 
terms of Aristotelian logic if one could 
assume that the premises as understood 
by the subject are the same as those 
understood by the examiner. That this 
is the case may be doubted in a number 
of instances; and the search for an 
archaic logic thus seems premature. 
Some examples will be given which 
seem to call for an analysis different 
from Arieti’s. 

Arieti cites the case of a woman who 


esses 


paranoid schizophrenics. 
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“needed to identify herself with the 
Virgin Mary because of the extreme 
closeness and spiritual kinship she felt 
for the Virgin Mary” (1955, p. 195). 
The reasoning process is described as 
follows: “The Virgin Mary was a 
virgin; I am a virgin; therefore, I am 
the Virgin Mary.” In the absence of 
additional material, we may only 
speculate that the proposition “I am 
the Virgin Mary” was not a conclusion 
deduced from the stated premises, but 
that it belongs rather in the category 
of revealed truth or intuition, not in 
that of deductive inference. It seems 
likely to the present writer that the 
syllogism quoted above was in the 
nature of a justification of this intui- 
tion to the examiner rather than the 
grounds for belief in it. The argument 
itself could be evaluated only in the 
light of additional information. 
Another kind of example of Arieti’s 
(1955) illustrates what he considers to 
be the propensity of young children 


, 


“to indulge in paleologic thinking” : 


A girl, three years and nine months old, 
saw two nuns walking together, and told her 
mother, “Mommy, look at the twins!” She 
thought that the nuns were twins because 
they were dressed alike. The characteristic 
of being dressed alike, which twins often 
have, led to the identification with the nuns 
(p. 199). 


Again a different interpretation sug- 
gests itself: rather than viewing it as 
a deduction, we may take this process 
more simply as an instance of an in- 
accurate concept. Arieti also discusses 
dream symbols as examples of paleo- 
logic thinking. It is particularly dif- 
ficult for the present writer to view 
such symbols as inferences, as if they 
were the conclusions of processes of 
deductive reasoning. They seem, 
rather, to involve a much more im- 
mediate grasping of similarities. 

It is amusing to note that Arieti 
(1955) regards some delusions of 
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identification as instances of paleologic, 
and describes the “formal mechanism” 
by which they are produced in a way 
that could easily be restated as a valid 
Aristotelian syllogism : 


If A may be identified with B because 
they have a common quality, it will be suf- 
ficient for me to acquire a quality of the 
person I want to be identified with, in order 
to become that person (p. 207). 


Further discussion of the problems 
raised by Arieti must await more 
complete data. 

The present study has implications 
also for another problem. To label an 
inference fallacious, as is clear from the 
analysis of this paper, is to make a 
statement about the results of reason- 
ing, but not about the process that is 
responsible for these results. It might 
be a matter of conside.able interest to 
study systematically and in detail the 
reasoning involved in various fallacies. 
It might be found that many fallacies 


are produced, not by faulty reasoning, 
but by specific changes in the material 


as the subject understands it. Some 
of the syllogisms employed here in- 
volved the fallacy of the undistributed 
middle ; here changes of premises from 
particulars to universals and other 
changes in the meaning of premises 
were not uncommon. Two of the 
arguments presented contained illicit 
conversions, but no relevant data were 
obtained since the subjects systemati- 
cally ignored the conversion and cen- 
tered their attention on other parts of 
the material. In connection with il- 
licit conversions, the discussion of 
Chapman and Chapman (1959) is of 
interest ; these authors suggest implicit 


6 This unexpected finding cannot be attrib- 
uted to an insensitivity of the subjects to 
logical processes. Rather the fault lies with 
the investigator who, in the effort to dis- 
guise the formal structure of the syllogism, 
included irrelevancies that were taken by 
many subjects as the material for evaluation. 
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assumptions and changes in meaning 
that may be involved in the acceptance 
of such conversions. Mill’s (1874) 
observation that fallacies frequently 
involve changing the premises has been 
cited above. He analyzes several ex- 
amples of the fallacy a dicto secundum 
quid ad dictum simpliciter* in these 
terms (p. 561-562). Fallacies of 
ambiguity seem likewise to lend them- 
selves to the kind of analysis suggested 
here.® 

In conclusion, observations have 
been presented which are consistent 


7“This is committed when, in the prem- 
ises, a proposition is asserted with a quali- 
fication, and the qualification lost sight of 
in the conclusion; or oftener, when a limita- 
tion or condition, though not asserted, is 
necessary to the truth of the proposition, 
but is forgotten when that proposition comes 
to be employed as a premise” (Mill, 1874, 
p. 561). 

8It is interesting to note that Mill's 
(1874) view comes close to the hypothesis 
offered here. It is true that he describes 
fallacies of ratiocination as: 


those which have their seat in the rati- 
ocinative or deductive part of the investi 
gation of truth (p. 559). 


However he adds that it is by the rules of 
the syllogism 


that a reasoner is compelled distinctly to 
make his election what premises he is 
prepared to maintain. The election made, 
there is generally so little difficulty in 
seeing whether the conclusion follows 
from the premises set out, that we might 
without much logical impropriety have 
merged this fourth class of fallacies in the 
fifth, or Fallacies of Confusion (p. 560). 


“he latter class includes: 


all those fallacies in which the source of 
error is not so much a false estimate of 
the probative force of known evidence, as 
an indistinct, indefinite, and fluctuating 
conception of what the evidence is (p. 
563). 


(This class includes errors involving ambi- 
guity of terms, begging the question, and 
fallacies of irrelevance.) 
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with the view that the rules of the 
syllogism describe processes that the 
mind follows in deductive reasoning, 
even when the syllogistic form is not 
explicitly employed.* A word of cau- 
tion is necessary at this point. The 
present observations apply to deduc- 
tive reasoning only. Even within this 
category, a very limited number of 
logical forms has been studied ; and the 
study has been concerned only with the 
evaluation of presented arguments, not 
with the construction of arguments of 


one’s own. The investigation needs to 


be extended before the generality of 
the conclusion 
must also be pointed out that there are 
many types of thinking to which logical 
analysis does not apply at all, for ex- 
ample many aspects of free association, 


can be assessed. It 


fantasy, many creative processes. To 
apply logical analysis to such processes 
(for example, Arieti’s discussion of 
dream symbols) is as much an error 
as to ignore the underlying logical 
structure of incompletely stated argu- 
ments. 


SUMMARY 


Two views of the relation of logic 
to thinking have been presented: the 
older view that logic is the science of 
thought, and the position more widely 
held at present that its relation to 
thinking is normative only. It has 
been acknowledged that ordinary 
thinking, more often than not, does not 
assume the form of the syllogism, and 
that errors in reasoning are frequent. 
But does error mean that the logical 
process has been violated? or can we 


If it could be established generally that 
the laws of logic are those of the human 
mind, this would not be to give the former 
the status of “mere facts” or to make them 
relative to particular conditions; rather it 
would be to recognize the requiredness in 
thinking (cf. Kohler, 1938, especially pp. 
45, 52). 
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account for it in other ways? Evi- 
dence has been presented that suggests 
that even where the thinking process 
results in error, it can often be shown 
that it does not violate the rules of the 
syllogism. Many errors were found to 
be accounted for not in terms of a 
breakdown of the deductive process 
itself, but rather in terms of changes 
in the material from which the reason- 
ing proceeds. The problem of the role 
of logical processes in everyday life 
was discussed more generally, and 
some implications of this study for 
other problems were considered. 
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ERRATA 


In an article by Joan E. Jones, “All-or-None versus Incremental 
Learning,” published in Psychological Review, 1962, 69, 156-160, 


errors appeared in the numbering of experiments. 


On page 159 in 


Table 1, and in the last paragraph of the text on page 159 where 
Table 1 is discussed, where Experiment III appears it should be 


read as Experiment I. 


In the table of contents on the front cover of the May 1962 issue, 
tiguity and Reinforcement in Relation to CS-UCS Intervals in 


Classical Aversive Conditioning.” 
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